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Symmetric Equilibria in Stochastic Timing Games 
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Abstract 

We construct subgame-perfect equilibria with mixed strategies for symmetric stochastic 
timing games with arbitrary strategic incentives. The strategies are qualitatively different 
for local first- or second-mover advantages, which we analyse in turn. When there is a local 
second-mover advantage, the players may conduct a war of attrition with stopping rates 
that we characterize in terms of the Snell envelope from the general theory of optimal 
stopping, which is very general but provides a clear interpretation. With a local first- 
mover advantage, stopping typically results from preemption and is abrupt. Equilibria 
may differ in the degree of preemption, precisely at which points it is triggered. We 
provide an algorithm to characterize where preemption is inevitable and to establish the 
existence of corresponding payoff-maximal symmetric equilibria. 

Keywords: Stochastic timing games, mixed strategies, subgame perfect equilibrium, op¬ 
timal stopping, Snell envelope. 
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1 Introduction 

The aim of this paper is to construct and understand subgame-perfect equilibria in symmetric 
stochastic timing games, which have important applications, for instance, in strategic real 
option models. It is well known that in many timing games in continuous time there exist 
no equilibria in pure strategies. If they do exist, however, they typically involve asymmetric 
payoffs that only depend on the respective roles of the players, which must be determined 
before the game starts. Then there is an unresolved strategic conflict. Here we strive for a 
rather general existence result and possibly symmetric payoffs, so we consider mixed strategies. 
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In particular, no assumption is made concerning the local incentives, which can move randomly 
between first- and second-mover advantages. 

Restricting attention to games with a second-mover advantage is known to be helpful for 
equilibrium existence. We begin by analyzing that case, too, demonstrating the general payoff 
asymmetry in pure-strategy equilibria. Our main contribution for this case is the construction 
of mixed strategy equilibria with symmetric payoffs in a possibly general model, making no 
specific assumptions concerning the underlyling uncertainty. Nevertheless, the equilibrium 
strategies have a clear characterization and interpretation, using the concept of the Snell 
envelope from optimal stopping theory. Specifically, we can describe the stopping rates that 
make a player indifferent to stay in the game when forgoing a profitable local payoff. Due to 
the possible uncertainty, the tradeoffs can be all in expected terms. Since we are not assuming 
any kind of smoothness or monotonicity of the underlying payoff processes, we also generalize 
existing results for purely deterministic models. 

With a first-mover advantage, there is often a preemption incentive that leads to equilib¬ 
rium existence problems even with mixed strategies in very simple, completely well-behaved 


deterministic models^ One needs to extend strategies to model preemption appropriately in 
continuous time, which requires some coordination mechanism (not to be confused with public 
correlation). As there is no respective “next period”, the players have to be able to ensure 
that the game ends instantaneously regardless, but without simultaneously sto pping to occu r 
with probability 1. We use the generalization of the co ncept of lFudenberg and Tirolel ()1985l ) 
to stochastic models provided in I Riede l and Steg (2014j), which gives us preemption equilibria 
that correspond to symmetric mixed equilibria in discrete time. These can be combined with 
the previous equilibria for local second-mover advantages with continuous strategies, to obtain 
a general existence and characterization of subgame-perfect equilibria with symmetric payoffs 
without any restriction on the order of payoff processes. 

These equilibria now allow for richer models of strategic real options, for instance, which 
typically focus on preemption. In the latter case one needs to ensure that waiting until 
preemption starts is optimal, however, which may hold in a specific model or requires a 


particular assumption^ Here we may have some continuous stopping beforehand. 

Depending on the profitability of future continuation equilibria, preemption does not have 
to be triggered just because there is a local first-mover advantage. As the term says, preemp¬ 
tion destroys future continuation payoffs; so the less often preemption occurs, the higher the 
resulting equilibrium payoffs. To determine at which times preemption is indeed inevitable, 
we provide an algorithm working under the additional assumption that simultaneous stopping 
is generally the worst outcome (weakly). Then we find that in any equilibrium with symmetric 


1 See . e.g..iFudenberg and_Tirol5 d 198511 an d lHendricks and Wilsonl (1 1 9921 1. 

2 See lRiedeTand Ste el d2014ll on s uch properties. For a stra tegic investment model where both preemption 
and attrition situations can arise see lStee and Thiissenl (120151 1. 
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payoffs in every subgame, the equilibrium payoff can never exceed the expected value of opti¬ 
mally stopping the minimum of the local leader and follower payoffs. That means, no matter 
how the players mix, possibly even with arbitrary public correlation and independently of 
infinite remaining time, they can never benefit from a high value of the underlying payoff 
processes if that is not attained by both the leader and follower payoffs simultaneously. 

Then we know that the game has to end by preemption whenever the leader payoff exceeds 
the equilibrium payoff bound. Iterating this procedure cumulates in the identification of times 
when preemption cannot be avoided. Confining preemption to those times, we obtain an 
equilibrium with least sustainable preemption and highest possible payoffs. 


1.1 Main theorems 


We use the formal concept o f subgame-perfect equilibria with mixed strategies for timing 
games developed in Riedel and Steg (2014), who argue that subgames are appropriately iden¬ 
tified by stopping times (the latter are feasible decision nodes, but cannot be represented by 
considering deterministic times only). Mixed strategies take the form of distribution functions 
over time, that can react to the dynamic exogenous information about the state of the world. 
We further apply the mentioned strategy extensions for preemption regimes. Uncertainty may 
affect the underlying payoffs, but may also (just) represent public correlation devices. Given 
that framework, this paper develops three main Theorems 15.4117.41 and 18.21 which build on 
each other. 

Theorem 15.41 constructs subgame-perfect equilibria with mixed strategies for games with 
a systematic (weak) second-mover advantage. Therefore the players have to coordinate on 
an appropriate payoff process, which consists of the leader payoff up to some feasible point, 
where there is either a simultaneous stopping equilibrium, or which is sufficiently late such 
that both players will then have stopped for sure. The equilibrium proceeds by optimally 
stopping this fixed process. As long as there are expected gains, no player stops. If a point is 
reached, however, where it would be strictly optimal to stop - i.e., any delay would imply a 
loss - then there has to be a compensation in terms of some probability to obtain the higher 
follower payoff. We characterize the exact rate that the respective opponent has to use to 
make each player indifferent at such points. 

Owing to its generality, this result is technically not as clear as its interpretation. With 
typical Brownian models, for instance, one cannot apply local arguments as there is no path 
monotonicity at all. Consequently, it is then also not possible to distinguish proper time 
intervals on which mixing occurs - imagine a Brownian motion fluctuating around the bound¬ 
ary of the region where mixing indeed takes place. Nevertheless, using martingale arguments 
we obtain a clear representation of strategies involving the concept of Snell envelope from 
the theory of optimal stopping, which allows us to speak meaningfully of a (local) expected 
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loss, for instance. These strategies will typically be continuous up to some terminal jumps. 
Another important question is then time consistency. If we define mixed strategies for all sub¬ 
games, i.e., stopping times, we have to ensure that they imply consistent conditional stopping 
probabilities throughout the game, which is generally not trivial. 

Theorem 17..‘II then makes use of the mentioned strategy extensions, which allow us to pro¬ 
vide symmetric preemption equilibria for regimes with a first-mover advantage. The theorem 
establishes that they form feasible continuation equilibria when leaving regimes with second- 
mover advantages. In aggregate we thus obtain payoff-symmetric equilibria for games without 
any restriction on the local incentives. There may be arbitrary, random alternations of first- 
or second-mover advantages. 

Theorem 18.21 determines efficient symmetric equilibria. While the previous ones involve 
extreme preemption - whenever there is a strict first-mover advantage - we now identify 
equilibria with least sustainable preemption, resulting in the highest feasible payoffs. For that 
purpose we focus on pay off-symmetric equilibria, with symmetric payoffs in every subgame, 
since this property has important implications for equilibrium strategies. Roughly, conditional 
stopping probabilities can only differ when players are currently indifferent to become leader 
or follower. With the additional assumption that simultaneous stopping is not strictly better 
than leading or following, in equilibrium the players can coordinate at most on optimally 
stopping the minimum of the leader and follower payoff processes. Whenever the leader 
payoff exceed that value, there must be preemption. Knowing this restricts the relevant 
stopping times in the previous problem, which further reduces the attainable value. Iterating 
the procedure formally as an algorithm identifies inevitable preemption points. 

Theorem 18. 2 1 establishes that we do obtain a well-defined equilibrium in the end, not only a 
limit value. It is based on the previous equilibria, but suppressing preemption where possible. 
The main problem is to show that there remain proper equilibria where preemption does take 
place, and that we indeed have well measurable, time-consistent strategies when applying the 
proposed algorithm to all subgames. 


1.2 Related literature 


Strategic timing problems appear in an abundance of contexts, in particular in economics but 
also in biology, e.g., and there is a vast related literature. 

On the one hand there is a branch on deterministic timing problems in continuous time 
addressing a wide range of applications, where typically a dist i ncti on is made between pre- 
emption models and w ars of attrition. Correspondingly, iHendricks and Wilsonl ()1992l f and 


Hendricks et al. 


(|l988l ) study stylized models w ith systematic first- and second -mover advan¬ 


tages, respectively. A war of attrition appears in Ghemawat and Nalebuffl (1 985 ) who consider 
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exit from a declining industry^ In a seminal contribution, Fudenberer and Tirole (19851 em¬ 
phasiz e subgame-perfection in a symmetric preemption game. iHoppe and Lehmann-Grub a 
0,2005) model a similar technology adoption game, allowing the leader payoff function to 
be multi-peaked while restricting the follower payoff to be nonincreasingo Without uncer¬ 
tainty, these games proceed quite linearly du e to perfec t foresight. More complications arise 


when the incentives may vary more freely. 


Laraki et al 


pQ05j) consider general deterministic 


IV-player games with payoffs that are just continuous functions of time (for given identities 
of first-movers). They prove that there do always exist e-equilibria, but not necessarily exact 
equilibria. 

On the other hand there is also a wide branch of the literature considering (continuous¬ 
time) timing games with uncertainty. Dutta and Rustichini (1993), e.g., formulate a sym¬ 
metric Markovian setting. However, restricting themselves to pure strategies, their Markov 
perfect equilibrium payoffs are generally asymmetric. 

Important applications with uncertainty are strategic real options. An early contri- 
b ution is ISmetsI (199 ill. A typica l symm etric model of preemptive investment is that of 


Mason and Weeds (2010 


while 


Weeds (2002|) considers strategic irreversible R&D investment, 


Murt o ( 200 4) studies exit from a duopoly. 


Finally, as we emphasize uncertainty, the literature on Dynkin games with its large tradi¬ 
tion has to be named. As these are two-person, zero-sum timing games, the classical question 
is the existence of an equilibrium saddle point, or val ue, under varying conditions. We here 
just refer to the more recent work by Touzi and Vieille (2002), since their payoff processes are 
very general and - more importantly - since they in troduce a nother concept of mixed strate¬ 
gies (but without consideration of subgames). Touzi and Vieille (2002) prove that many more 
Dynkin games have a value if one allows for such mixed strategies. 

Recently, also some more abstract wor k cons idering stochastic timing games with non- 
zero-sum payoffs has been conducted. Hamadene and Zhang ( 2010 1. e.g., prove existence of 
Nash equilibrium for 2-player games with a general second-mover advantage]^ 


iFudenberg and Tir olj ( 1986 1 analyse a market exit problem with incomplete information. 

iBulow cinTMempereTTl999li consider a similar problem with more than two firms. _ 

4 Some implication of F being a supermartingale will be di scussed here in Section 2] iDutta et all 


obtain again a similar structure as iFudenberg and Tirolel lll985l f (including the single-peakedness assumption) 
fro m a mode l of product diff erentiation. 

IPawlina and Kortl ll200filf cons ider a similar model with asymm etric investment costs and Thiisscn] (l2010h 
one with firm-specific uncertainty. lLambrecht and Perraudinl (l2003l l model preemption with incomplete infor¬ 
mation. 

6 See a lso Hamadene and Hassanil ( 2014 1 for an extension to N players using a similar approach. 
lLaraki and Solanl ~ l201^ 1 make less assumptions concerning the incentives in a 2-player game. Consequently, 
even allowing for mixed strategies, they can only prove existence of e-equilibria. 
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1.3 Outline 


This paper is organized as follows. In Section [2] we define our timing games, making only 
minimal regularity assumptions, and we introd uce th e con cept of subgame-perfect equilibria 
in mixed strategies as developed in Riedel and Steg (120141 1. 

Although we are generally working with mixed strategies, equilibrium verification is re¬ 
lated to solving optimal stopping problems by linearity, for which we establish a convenient 
representation in Section [31 There we also present the needed facts from the general theory of 
optimal stopping. On the one hand, strategies will be represented by the Snell envelope, which 
we motivate. On the other hand, in our games we have to be quite careful about existence of 
optimal stopping times, which depends strongly on path properties of the involved processes, 
so we will address some details. 

By a first application of this theory in Section [H we establish equilibria in pure strategies 
and argue that they typically generate coordination problems. These are resolved in Section [3] 
by the construction of subgame-perfect equilibria in mixed strategies, in a first step for games 
with a systematic second-mover advantage. Although our representation of the equilibrium 
strategies can be well interpreted, we derive a completely explicit equilibrium for a market 
exit example in Section [31 

In Section [7] we use the mentioned strategy extensions to obtain equilibria in regimes with 
first-mover advantages, which then enables us to construct and characterize subgame-perfect 
equilibria for arbitrary symmetric stopping games. Finally we identify equilibria with maximal 
payoffs and least possible preemption in Sectional Section [U] concludes. The appendix contains 
some technical results and the proofs. 


2 The timing game 


We use_ the framework for subgame-perfect equilibria with mixed strategies developed in 
Riedel and Steg (2014), where the concepts summarized in this section are explained in more 
detail. Here we only consider symmetric games, which allows some simplifications incorpo¬ 
rated in the following. 

The timing game consists of two players i = 1,2, who each decide when to stop in continu¬ 
ous time t G [0, oo]. However, there is uncertainty about the state of the world, modeled by a 
probability space (R, P). The partial information about the true state evolves exogenously 

over time, represented by a filtration o- The player’s stopping decisions may of course 

use this information, so a feasible plan is in principle a stopping time; see Section 12.11 for the 
formal definition of strategies. 

As usual in timing games, we focus only on situations (resp. histories) in which no player 
has stopped, yet. Therefore, the game ends as soon as any of the players stops. A player who 
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is the single one to stop first is called the leader. In this case the other player becomes the 
follower. Their respective payoffs are determined by the two stochastic processes L and F. 
Both processes incorporate the possible effect of an (optimal, contingent) stopping decision 
that the follower might have in a more primitive model, given that the opponent has already 
stopped - as in Example 12. Ml below. If the game ends by both players stopping simultaneously, 
their payoffs are determined by the third process M. All processes are measured in the same 
numeraire, say, discounted to time 0, and the players are risk neutrally 

Equilibria will obviously be based on solving optimal stopping problems involving the 
three underlying payoff processes. We need to make some weak regularity assumptions in 
order to have well defined problems in the following. 

Assumption 2.1. 

(i) (f is a fixed probability space equipped with a filtration F = {&t) t>Q satisfying 
the usual conditions (i.e., F is right-continuous and complete). 

(ii) The processes L, F and M are adapted, right-continuous (a.s.) and of class (D), M 
having an extension with E'[|M 00 |] < oo. 

(iii) min(L, F) is upper-semi-continuous from the left in expectation, in fact on [0, oo] if we 
put L 0 Q FqQ MqQ. 

Remark 2.2. 

(i) The payoff processes L, F and M do not have to be random; deterministic ones are just 
a special case. Even then the probability space and filtration might be nontrivial and 
represent possible public randomization devices. The current payoffs at any time t just 
should be known by the public information ,Pf 

(ii) Two important general technical issues are measurability, in particular concerning strate¬ 
gies that we address below, and integrability. We need to ensure that expectations are 
always well defined and that pointwise converging random variables converge in expec¬ 
tation, too. Class (D) is the possibly weakest integrability condition we can work with§| 
Boundedness would be much too strong for many applications (e.g., involving Brownian 
motion). 

Alternatively, one can interpret the payoff processes as measured in discounted “utils”. 

S A measurable process X is of class (D) if the family {X T : r < oo a.s. a stopping time} is uniformly 
integrable. Then the family is bounded in expectation and pointwise convergence of X at a stopping time implies 
convergence in F(P) as well. This is a mild regularity condition implied, e.g., by either P[sup t |W|] < oo or 
sup T E[\X T \ P ] < 00 for some p > 1. We may equivalently define any Xoo £ L 1 (P) and consider all stopping 
times (possibly taking the value oo) in the previous set; cf. Lemma I A. II 
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(iii) It depends on the model whether there is a natural payoff if both players “never stop”, 
which may be some limit of M or of L. In the latter case we simply set := L^ and 
work with M,^ for a unified payoff notation. For convenience, we also formally define 

Foo := Mqo. 


(iv) In order to have any general existence results for equilibria, some path regularity of the 
payoff processes is necessary, as can be seen clearly even in the deterministic, single 
agent case. Nevertheless, it suffices for us to have upper-semi-continuity from the left 
only in expectation^ In optimal stopping problems this property is also needed for 
existence. We use it for equilibria in mixed strategies when there is a (local) second- 
mover advantage. It is of course only required for L if that never exceeds F. Indeed, 
one could restrict attention to intervals [r, inf{t > r\L t > F t }\, where r is a stopping 
time; the area {L > F} is only relevant at transitions. The assumption is satisfied, e.g., 


if the paths of L and F are a.s. (upper-semi-)continuous from the left 


0 


Example 2.3. Let us consider a market exit problem as a simple example for a stochastic 
timing game with second-mover advantage, i.e., F > L, like in the classical war of attrition 0 

Suppose that two firms are operating in one market such that duopoly returns ir D might 
not be sustainable in the long run, depending on uncertain exogenous conditions. While each 
firm would in general like the opponent to leave the market in order to earn the monopoly 
profit ir M > 7 t d , it might be too costly to wait for that possibly random event. Each firm 
thus decides on times when waiting becomes no longer promising, and at which to leave the 
market if the other is still present. 

The payoff processes are then: 


L t = M t := [ it f ds, 
Jo 


F t := L t + ess sup E 

' rr F 

[ 7 T f ds 


t<T F && 

Ut 

- 


for all t € [0, oo] (the convention F 00 = Jll m = L ^ here holds naturally). Any monopolist 
may drop out at a stopping time , since the monopoly return need not be profitable, either; 
then immediate exit is the dominant strategy and the second-mover advantage will not be 
strict. However, we do not model the strategy of a single remaining firm, but incorporate 
the corresponding optimal decision in the payoff processes. The idea of subgame perfection 

Upper-semi-continuity from the left in expectation means E ( T A F T j ^ lim s11p r; E^Lj- n A Fr n ] for any 
sequence of stopping times (t„) that is a.s. increasing to a stopping time r. 

10 Then lim sup s ^, t (L s A F s ) < (limsup s ^, t L a ) A (limsup s ^, t F s ) <L t AF t for all t £ [0, oo] a.s., and we note 

that L and F are of class (D). _ _ 

11 For typical examples of preemption type, see iRiedel and Steel (2014). 
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requires that the latter be chosen optimally. 

Assumption ^. II is satisfied in this example if n D and tt m are adapted and P®dt-integrable, 
as all processes are then bounded by an integrable random variable. It follows from our 
discussion in Section ixn below that there exists a right-continuous process F such that the 
relation m holds even when substituting t by a general stopping time r, which is one of the 
most important results in continuous-time stopping^ 


2.1 Mixed strategies and equilibrium concept 


The concept of subgame-perfect equilibrium for stochastic timing games of Riedel and Steg 
(2014) is as follows. The feasible decision nodes in continuous time are all stopping times. 
Therefore we consider any stopping time as the beginning of a subgame, with the connotation 
that no player has stopped before. Let 8F denote the set of all stopping times w.r.t. our 
filtration F. 

We hence specify complete plans of actions for all subgames, taking the form of (random) 
distribution functions over time. In order to aggregate strategies for the whole game, one 
requires time consistency, meaning that Bayes’ law has to be respected wherever it applies. 

Additional strategy extensions are needed for subgames with first-mover advantages, to 
model preemp tion appropriately in continu ous timeF^l Therefore we use the generalization of 
the co ncept of lFudenberg and Tirolel (|l985T) to stochastic models developed in lRiedel and Steg 
(2014) - which preserves the interpretation of discrete-time limits. These extensions are 
introduced immediately, although one can abstract from them in the discussion of games with 
a second-mover advantage. We will take them up later for general games. 


Definition 2.4. An extended mixed strategy for player i G {1,2} in the subgame starting at 
id G ST , also called id-strategy, is a pair of processes (Gf, af) taking values in [0,1], respectively, 
with the following properties. 

(i) Gf is adapted. It is right-continuous and nondecreasing with Gf(t) = 0 for all t < id, 
a.s. 

(ii) af is progressively measurable 0 It is right-continuous where af < 1, a.s 

12 This issue is somewhat more delicate for L if we have an example in which the leader’s payoff also depends 
on the stopping time eventually chosen by the follower, as in an entry model. The follower’s decision involves of 
course no optimality condition with respect to the leader’s payoff stream, which may make the leader’s payoff 
discontinuous from the right in expectation (which optimality exactly prevents for the follower’s payoff). In 
general that pro blem will not arise i n diff usion models, however. 

13 See also iHendricks and Wilsonl (1 19921 1 on (non-)existence of equilibria in deterministic preemption games. 

14 Formally, the mapping af: x [0, £] —s- R, (w, s) i-» af(uu,s) must be St ® S([0, t])-measurable for any 
t € R+. It is a stronger condition than adaptedness, but weaker than optionality, which we automatically have 
for Gf by right-continuity. Progressive measurability implies that af (r) will be ^V-measurable for any r G S. 

15 This means that with probability 1, af (•) is right-continuous at all t £ [0, oo) for which af (t) < 1. Since 
we are here only interested in symmetric games, we may demand the extensions af(-) to be right-continuous 
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(iii) 

af (t) > 0 =$■ Gf (t) = 1 for all t > 0, a.s. 

We further define Gf (0—) = 0, Gf(oo) = 1 and af(oo) = 1 for every extended mixed strategy. 

Note that we do not require the players to stop in finite time. Then player i may for 
instance decide to stop simply at some stopping time r > d, which is interpreted as a pure 
strategy and corresponds to Gf(t) = 1 t> T for all t > 0 (and af = lt>oo). If af = 0 on 
[0, oo), we loosely speak of a “standard” mixed strategy. Such extended mixed strategies are 
completely equivalent to mixed strategies in the analogous model without extensions af. 

Definition 2.5. Given two extended mixed strategies (Gf,af), (Gf,af), i,j £ {1.2}, i ^ j, 
the payoff of player i in the subgame starting at d £ ST is 


Vf (Gf,af,Gj,Oj) :=E 


f (1 -Gf(s))L s dGf(s)+ [ (1 -Gf(s))F s dGf(s) 


[o,^) 


+ ^2 (s)AGj ( s)M s + A+ A L,jF?# + 
se[o ,f tf ) 




At f" := inf{t > i!}\af(t) + af(t) > 0}, the extensions af determine hnal outcome 
probabilities Af Af • and A f,. Their definition is given in Appendix [C] for completeness; it 
is a simplification of that in Riedel and Steg (2014), thanks to the slightly stronger regularity 
here. Note that if both players reserve some mass for t = oo (whence f d = oo, Af ^ = \f j = 0), 
the corresponding payoff will be (l — Gf{ oo— ))(1 - Gj( oo— ))Afoo, since we have defined 
Gf (oo) = af (oo) = 1. 

The pathwise integrals do include possible jumps of the right-continuous integrators at 0, 
since player i can become leader/follower from an initial jump of Gf/Gf, respectively. The 
payoffs are indeed well defined under Assumption 12.II and bounded in expectation - uniformly 
across all feasible strategies; cf. Lemma IA.2I 

To aggregate ^-strategies across subgames, time consistency in the form of Bayes’ law has 
to hold. 


Definition 2.6. An extended mixed strategy for player i £ {1,2} in the stopping game is a 
family 

(Gi,Oi) := ( Gf,af 
of extended mixed strategies for all subgames d £ EF. 

also w here they take the value 0, which simplifies the definition of outcomes. See Section 3 of lR.iedel and Steal 
(120141 ) for issues with asymmetric games and corresponding weaker regularity restrictions. 
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An extended mixed strategy (Gj, ctj) is time-consistent if for all i? < £ ST 


< t G M+ =► G?(t) = Gf ($'—) + (1 - Gf(0'-))Gf (t) a.s. 

and 

if < r £ J 4 af(r) = af (r) a.s. 


Note that time consistency implies in particular that for any two subgames il, i? 7 £ ^ we 
must have Gf = Gf* (a.s.) on the event {$ = i? 7 }, as one would reasonably expect. The 
equilibrium concept is then natural. 

Definition 2.7. A subgame-perfect equilibrium for the timing game is a pair (Gi, a.f) , (G 2 , 02 ) 
of time-consistent extended mixed strategies such that for all 1 ? £ i,j £ {1,2}, i / j, and 
extended mixed strategies (Gf,af) 


-\r'd(s~vd /oi? 

(^Z 5 ^Z 5 5 


a. 


)>^(Gf,af,G>f) a.s. 


i.e., such that every pair (Gf,af), (Gfjof) is an equilibrium in the subganre at 7 ? G 
respectively. 


3 Best replies and optimal stopping 

The payoffs in Definition 12.51 are apparently linear in strategies, in this section we derive a 
more explicit representation of this linearity, which will be very helpful for rigorous proofs to 
verify equilibria, but also for necessity arguments. To determine or verify any best replies, 
one needs to maximize over (extended) mixed strategies against these same objects in general, 
of course. Here we make related statements such as “any stopping time in the support of the 
mixed strategy needs to be optimal” precise. We further introduce the central concepts from 
the theory of optimal stopping, notably the Snell envelope, which plays a crucial role in the 
following representation and interpretation of mixed strategies in equilibrium. 

The following arguments concern the distributions Gf, so we neglect the extensions until 
Section [7] for simplicity (which formally means restricting to af = lj>oo, as mentioned in 
Section EH he., to “standard” mixed strategies). 

Now, for the alternative representation of the payoff of player i G {1,2} in the subgame 
at d G 2F, we introduce the process Sf given by 

sf(t) := [ F s dGj(s) + A Gf(t)M t + (l - G%t))L t (3.1) 

for all t G [0, 00 ), where Gf is a given feasible mixed strategy for the opponent j G {1, 2} \ i. 
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Lemma IA.3I shows that this process is well behaved: it is optional and of class (D). With 
Moo £ ^f{P), we can extend the definition of Sf in m to t = oo implying also Sf(oo) £ 
L 1 (P)0 

Thanks to Lemma [A.21 and Sf(oo) £ L 1 (P), we can integrate Sf by any feasible dGf to 
see that the expected payoff of player i in the subgame beginning at d £ FT can be written 


V?(Gf,Gf) = E 


Sf(t)dGf(t) 




(3.2) 


J [0,oo] 

As is to be expected by the linearity of dm there exists a best reply only if there is one 
that is a pure strategy. 

Lemma 3.1. Fix d £ FT and let Gf and Gf be feasible. Then 


Vf(Gf, Gf) < ess sup E [Sf (r) | &#] a.s. 




Proof: In Appendix lB.il 

From the lemma (and more explicitly from its proof) we see that a feasible strategy Gf 
will be a best reply to Gf if and only if for any stopping time r* such that dGf(r*) > 00 


E[Sf(r*)\^\ >E[S?(t)\&o] Fd<r£FT 


and we generally have to solve the optimal stopping problem on the right in Lemma l3.ll 
The central aspect of continuous-time games of timing is their inherent discontinuity, even 
if the underlying data (here L, F, and M) is continuous. For instance, from the definition 
of Sf in (13.11) it is now immediately clear that a best reply cannot put any joint mass points 
where F > M, since lim^o Sf (t + e) — Sf (t) = A Gf(t)(F t — M t ) by right-continuity of L; this 
will be a frequent argument. Depending on Gf, there need not exist any stopping time that 
actually attains the value of the problem, as Sf may have various kinds of discontinuities. 
Dealing with such discontinuities will be one of our major issues. 

In the following subsection we present some crucial facts from the general theory of optimal 

16 This means that Sf is measurable w.r.t. the optional cr-field on the product space SI x R+, which is 
generated by all right-continuous adapted processes, or equivalently by the random intervals [0, r), r £ &. 
17 Note that the integral in <EQ converges, as it is bounded by J ^ |F S | dGf(s) £ Gi^P) thanks to Lemma 

roi 

18 An application of Fubini’s theorem using footnote |TT] yields in particular 


[ (l -Gf(s))F s dGf(s)= [ [ 1 t>s dGt(t)F s dGf(s) 

J [0,oo) J [0,oo) ^ [0,oo] 


-/ / 

J ro,ooi J ro 


1 s<t F s dGf{s)dGf{t) = 


[0,oo] J [0,oo) 


/ / 

J [0,oo] J [0,£) 


F s dGf (s) dGf (t) £ L^P). 


19 This means that Gf (t) > Gf (r—) for all t > t a.s. 
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stopping in continuous time, providing in particular sufficient (and basically necessary) con¬ 
ditions for the existence of optimal stopping times and their characterization in terms of the 
Snell envelope. The latter is in fact our main tool to derive and represent mixed equilibrium 
strategies. 


3.1 Optimal stopping in continuous time 

As a motivating stopping problem to present the theory, consider the unilateral problem 
of when to become the leader optimally, i.e., supposing the opponent will never act. This 
problem will play an important role in the following^ 

It is well established how to characterize the solution of the optimal stopping problem 


Vl( 0) := ess sup E[L T ] 

r&sr 

given Assumption 12.11 In fact, our payoff process L is right-continuous (hence optional) and 
of class (D), such that we can apply the general theory of optimal stopping as in, e.g., 


Mertens 


(1972) and Bismut and Skallil (1977): There exists a smallest supermartingale Uj, dominating 
the payoff process L, called the Snell envelope of L, which satisfies 


Ul(&) = ess sup E [L t \ &&] a.s. 
■d<T&5T 


(3.3) 


for all stopping times $ £ . In particular Ul{ 0) = Vl{ 0). We remark that one can very 

well define the RHS of (13.31) for any •& € U?, but the key insight is that there exists a well 
behaved process Ul = (UL(t)) t> 0 , which one can evaluate at any stopping time i? to know 
the continuation value there. In view of the dynamic programming principle, we do need to 
consider continuation problems at stopping times; the latter are feasible quantities, but much 
richer than deterministic times. 

Now Ul is optional and of class (D) as welj 2 *] and such supermartingales have very conve¬ 
nient regularity properties: There exists a Doob-Meyer decomposition 22 ! 


U L = M l - D L 

that we extensively use, with a uniformly integrable, right-continuous martingale@ Ml and 
a nondecreasing, predictable and integrable process D] J . The latter can be interpreted as 
measuring the expected loss from stopping too late : If we postpone any stopping to r > 0, 


20 To stay in the framework of the game, for finding a (pure) best reply to G° = lt>oo we have to use the 
payoff Mqq for not stop ping in finite time. Recall our convention of setting Loo = Moo, however. 


21 


See 
2 See 


Mertens 


(1972 ), Theoreme T4 for the existence and Theoreme T5 and proof for Ul being of class (D). 


Mert ens (jl£72j), Theoreme T3. 


23 Therefore, the crucial optional sampling holds: Mn(cr) = E[Ml(t) \ J? CT ] for all a < r £ ST. Further, Ml 
has a last element Ml( oo) to which it converges in L 1 (P). 
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then we cannot achieve more than E[Ul(t)\ = Ul( 0 ) — E[Dl(t)\, even if we stop optimally 
from r onwards. 

Reflecting the dynamic programming principle, the value process Ul is a martingale as long 
as there still exists a future time r giving at least the same value in expectation as stopping 
immediately. Whether there exists any optimal stopping time depends on the continuity 
properties of Dl- If L is upper-semi-continuous in expectation (as by Assumption I2.1|(iii) if 
L < F, e.g.), Dl has left-continuous paths a.s@ By right-continuity of L, Dj, will be even 
continuous! 25 ! With Dl left-continuous, there exist the optimal stopping times 26 ! 

r£,($) := inf{f > i? | Ul{P) = L t } and := inf{f > $ | Z?i(t) > Dl ($—)}• (3-4) 

They are the respectively smallest and largest stopping times after $ € F? attainin J 2 ^! 


Ul($) = E[L Tl w | jy = E[L t l w | a.s. 


(3.5) 


Hence, by optimality it must hold that Ul = L a.s. at any point of increase of Dl, which 
implies 28 ! 

poo 

/ (U L (t) - L t ) dD L (t) = 0 a.s. (3.6) 

Jo 


4 Equilibria in pure strategies 

In symmetric games with a systematic second-mover advantage F > L, it is straightforward 
to identify certain subgame-perfect equilibria in pure strategies. Player j, say, just has to stop 
sufficiently late, such that i will solve the problem of optimally stopping L presented in Section 
cm We show in this section that such pure strategy equilibria typically entail asymmetric 

24 See iBismut and Skallil (119771 1. Theoreme II.2 and proof. (Semi-) Continuity in expectation is in general 
weaker than the corresponding path property from the left. 

Our payoff processes are not necessarily positive. However, if L is optional and of class (D), the same will be 
true for its negative part L~ := max(— L, 0), which thus has a Snell envelope U L ~ = M L - — D L - decomposing 
into a uniformly integrable right-continuous martingale M L - and an integrable increasing process D L ~. Then 
M l ~ — L~ > 0, implying L + M L - > 0. Adding the martingale M L - neither affects L being optional, of class 
(D), or (semi-) continuous in exp ectation, nor any optimal stopping times for L. 

25 See [Bismut and Skallil (1 19771 ). (2.15), where right-continuity of the payoff process implies in fact Z + = X. 

26 Example: L not upper-semi-continuous => inf {Dl > 0} not optimal. 



27 See [Bismut and Skallil (119771 '). Theoreme II.3. 

28 If Dl is continuous, Ul inherits right-continuity from Ml- Then, by 113.411 . (13.511 and right-continuity of 
Ul — L, inf{t £ R + | J Q 1 UL - L >e cIDl > 0} = oo a.s. for any e > 0, i.e., Ul — L < e (IDl- a.e. with probability 
one, implying the claim. (1X1)1) also holds without right-continuity of Ul — L, if L is upper-semi-continuous in 
expectation; see Remark IB. II in the appendix. 
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payoffs, however. The respective roles of the players have to be determined before the game 
starts, and correspondingly who obtains the higher payoff. With mixed strategies that we 
will consider thereafter, one can obtain equilibria with symmetric payoffs that do not create 
another strategic conflict outside the model. 

Stopping sufficiently late to support a pure strategy equilibrium need not be “never” as 
in the previous section; whenever it is optimal to stop L, it must not be worthwhile for i to 
wait until j stops, in order to become follower then. This will be the case, e.g., if j stops only 
at times where F = L - or simply at oo. 

The easiest example is thus G*j = 1 t =OQ and Gf = l t>TL ^ for all d £ or analogously 
with t l (-) defined in (13.41) . In either case waiting is indeed optimal for j on [0, oo), because 
there are expected gains of L on any interval [i?, t l ($)], and F dominates L at both 
and r L (d). 

There can also be quite complex patterns based on the same logic, but with players 
switching roles across subgames. This can be illustrated best with a little more structure 
as in Example 12.31 where the follower’s optimal stopping times will be “sufficiently late” for 
an equilibrium. However, the arguments generalize a bit: the exploited properties are that 
F > L > M and that F is a super martingale, i.e., that one becomes follower the sooner the 
better. Then stopping before tl($) is dominated (for any G'J). 

Lemma 4.1. Suppose F > L > M and that F is a supermartingale. Fix id € F?. For any 
feasible Gj and stopping time Ti > d, 


and 


E 


Sf((Ti V T L (■&))+) 




> E 


sr 



E 


Sf(r L (d)+) 




> E 


L 


Tl(0) 



a.s. 


Both inequalities also hold with r L (d). 

Proof: In Appendix lB.il 

Note that the supermartingale property of F is important for the result, to ensure relatively 
high payoffs in case one becomes follower before the optimum of L is reached. It is not sufficient 
that there are even strictly better future stopping times for L and that F > L: If Gj puts 
mass between d and tl (d) where F still dominates L , but where both are very low, then it 
may be worthwhile to secure the current payoff L$ due to the risk of becoming follower while 
waiting for the optimum of L. An alternative condition would be that L is a submartingale 
on [d,T L (d)]. 

In Example 12.31 L = f 0 tt d ds is the duopolists’ payoff process. Then the optimal stopping 
times in the follower’s problem are sufficiently late to support an equilibrium: the perspective 
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to become follower (monopolist) at a time when immediate exit is optimal has no value, and it 
leads to ceding when n D seems an unsustainable loss - at Tp(id). Indeed, as a monopolist stops 
/ 0 7 t a/ ds with ir M > ir D , that optimal stopping time satisfies tf( id) > tl(i?). Furthermore, 
it holds that F = L = M a.s. at Tp(id), so in particular simultaneous stopping is feasible 
on {Ti('d) = Tp (■$)} by F = M. These properties generate a whole class of equilibria with 
varying roles of the players, decided by events C at Tp(id). 

Proposition 4.2. Suppose F > L > M and that F is a supermartingale. Fix id £ FT and 
consider a stopping time Tp(id) > a.s., such that at Tp(id) we have F = L, and more 

specifically F = M on the subset { Tp (i?) = rplfid)} a.s. - e.g., Tpfd) := inf{f > id \F t = M t }. 
Then, for any given event C £ the pure strategies corresponding to 

Ti* = r L (id)lc + Tp(id)lcc and r 2 * = r L (id) 1^ + Tp(id)l c 

form an equilibrium in the subgame beginning at id. 

Proof: In Appendix lB.il 

Equilibria in pure strategies typically involve asymmetric payoffs, for instance if F > L at 
Tp(id) in those that we have specified. Consequently, there arises a coordination problem before 
the start of the game, each player wanting to become follower eventually. This problem is even 
aggravated in the equilibria of Proposition 14.21 where the roles may switch across subgames. 
For this reason such equilibria are also not easy to aggregate to a subgame-perfect equilibrium: 
for each subgame starting at some id £ FT, an event C £ ^ TL {d) h as to be agreed on that 
determines the respective roles. 

Maybe even more importantly, no player can obtain the preferred follower payoff by taking 
or threatening to take a certain action, but only by the threat of taking no action for a longer 
time, which has to induce the opponent to stop. Effectively, players compete in the credibility 
to take no action. Such problems can be avoided by allowing for mixed strategies, making the 
players indifferent about the roles when stopping occurs. This is our topic in the following. 

5 Equilibria in mixed strategies 

The very universal principle of the Snell envelope allows us to construct equilibria in mixed 
strategies in our general setting as well. But we do not only get existence: our equilibrium 
strategies can be clearly interpreted like the Snell envelope itself. Recall that the compensator 
relates to the expected loss from stopping too late. 

The logic of the following equilibria in symmetric games is this. If F > L but without 
the other conditions of Lemma EH waiting for future optimal times to stop L may not 
be a dominant strategy. Nevertheless, the players have an incentive and the possibility to 
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coordinate on not stopping too early, which can be extended until the latest optimal time 
r L fd) to stop L. To cross that point, however, any player has to be compensated by some 
chance to become follower with F > L, because otherwise any delay would definitely be costly. 
Of course the opponent has to be willing to provide that chance, so we identify the appropriate 
rate to compensate exactly the impending loss dDi > 0, to make both indifferent. 

This principle does not work where L > F, however, when there would be much more 
intense stopping due to a preemption incentive (see Section [TJ) . On the other hand, even if we 
were considering only games with a global second-mover advantage, there may be equilibria 
with even higher (symmetric) payoffs - if simultaneous stopping is feasible and sufficiently 
profitable at some future point, precisely where M > F > L. For these reasons we need to 
adapt the appropriate payoff process that players can coordinate on. 


Theorem 5.1. Consider a subgame beginning at d E FT with F$ > L$ a.s. on {■$ < oo}, 
and another stopping time r i} E FT taking values in [i?, inf{f > d \ F t < L t }\ a.s. Define the 

~ ft ft 

payoff process L T := l t<r &L + l t>T & rnax(F r o, M r &) and as the compensator of its Snell 
envelope. 

Then there exists a payoff-symmetric equilibrium with mixed strategies satisfying 


and 


G? (0 = 1 - 1 t<F exp 


[* dDf(s) \ 

Jti F s — L s ) 


(*) = !- h<C exp 


rt 1 Fs > Ls dDf(s) \ 

$ F s L s J 


(5.1) 


(5.2) 


i, j E {1,2}, i j, iff AGf(M-F) < 0 atinift £ M+ | Gf(t) = 1} < andAGf(M-F) > 0 
at t® < oo a.s. 

There further exists a symmetric equilibrium with both players using the strategy <E3H iff 
AGf{M -F) = 0 a.s. at inf{t E M+ | Gf(t) = 1} < t*. 


Proof: In Appendix lB.il 

The “endpoint condition” at inf{f E R + | Gf(t) = 1} =: 1) seems contradictory, but 

it plays the following role. First suppose AGffr 0 ) > 0, so r 4 ' (1) = t & and there is a joint 
terminal jump. This means that the players coordinate on the terminal payoff M r o, which is 
only feasible if M r <> > F r d (recall M^ = Foo by convention). 

Gf can indeed jump to 1 before r , where F = L. There we choose G^j continuous to 
address the case F = L > M , where payoffs are hence symmetric. This choice can only be an 
equilibrium, however, if indeed F > M. Otherwise j could obtain a higher value by stopping 
at t,- (1) and not supporting the equilibrium earlier on; we could then simply adjust to 

ensure the correct continuation values. 
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Finally, Gf can reach 1 also continuously (then Gf = Gf). This case is one reason why 
we write “ma x(F T 0 ,M T #)” in the definition of L T ^, although we require A Gf(M — F) > 0 at 
t' 1 . Even if A Gf = 0, the terminal value of L rl determines the continuation values on which 
players coordinate earlier on. Putting M r # regardless would not be correct. Another reason 
is that we will indeed obtain continuation equilibria with payoff max(i ? r ^, M T &) when we take 
up extended mixed strategies in Section [3 

The strategies here are continuous except for terminal jumps. As motivated above and 
given the appropriate payoff process L T , the opponent’s stopping rate RD ’~ /(F — L) makes 
each player indifferent when it would seem optimal to secure L. beforehand. That probability 
to obtain F. > L. exactly compensates any expected loss from forgoing L.. The resulting 
equilibrium payoffs are 


v? = vi 


ess sup E 

i)<Te^ 


1 t<t #L t + l r > r tf ma x(F t 0 ,M t #) 



m (*). 


The proof of Theorem 15.11 is based on martingale arguments. An important aspect is to 
take care of the different kinds of jumps in the strategies and to ensure that the underlying 
payoff process L T has the necessary properties (e.g., that _D£ is continuous). Where stopping 
happens continuously, it need not have a rate with respect to time dt, however (though it does 
in the explicit Brownian example in Section [U]); it might only take place on a set of time points 
of measure 0@ 

The strategies are trivial in the present equilibria - given that the endpoint is feasible - 
if either L or F is a (sub-)martingale on [d, r’^]; then there is no loss from waiting and 

dD? = 00 

Remark 5.2. It may happen that D ^ and hence Gf has jumps if L is only upper-semi- 
continuous from the right (and the left). In the specified equilibria, waiting is always at least 
as good as obtaining L and there must be indifference at increases of D £. This is of course 
not possible with a joint mass point where L > M0 Theorem 15.11 remains true if instead 
L = M (e.g., in an attrition model); see Remark lB.il 

The equilibria of Theorem 15.11 can deal so far only with subgames satisfying F$ > L$. 
Therefore, if we want to aggregate them to a subgame-perfect equilibrium, we have to assume 

29 In t his case Gf wo uld be a singular measure, which often appear in optimal control of Brownian models. 
30 Cf. iRiedel and Steel d2014h . Theorem 3.3, for this case, but also their Section 4.3 concerning issues in 
asymmetric games. 

31 Example'. No symmetric payoff equilibrium if L (> M ) not right-continuous. 


F -- 



Waiting is strictly optimal for i at t £ (Ti,T 2 ) if GjiT^—) > Gj(t), hence 
Gi(T 2 ~) = Gi(T\) and Gj(T 2 —) = Gj(T\) by payoff symmetry, giving a 
continuation payoff LfTb) on (Ti,T 2 ]. The only symmetric continuation payoff 
at Ti is then in (Lffi), L(T\)) from AGi(Ti) = AGRTi) € (0,1). Waiting is 
also strictly dominant on [0,Ti), but stopping short of Ti now yields a higher 

L — 

M ^ 




T\ 

t 2 

payoff than stopping at Ti. 
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F > L for now. Then existence is trivial by setting r" = inf {f > i? | Mt > Ft} for all d £ tY. 
If also F > M throughout, we get the further simplification that we can set t' 1 = oo and 
hence L T = L, D T f = Dl- In the latter case the stopping rates do not depend on d, which 
ensures time consistency. 

In general, however, we do not preclude the possibility that at some r* 5 , max(T, M ) > L or 
rna x(F, M) < Ul, such that D T ^ Dl on [i?, r^]. Then time consistency requires that t' 1 will 
not be changed if it has not been passed yet: for any two subgames £ 5 we should have 
t® = t®' on {i? < d' < r^} and vice versa (in summary, = t®' on {(d V d') < (r^ A r'^)}, 
noting that d < T r> and d' < t 0 ' a.s.). 

We now have the following payoff-symmetric subgame-perfect equilibria with “standard” 
mixed strategies for games with systematic second-mover advantage. 

Theorem 5.3. Assume F > L and fix i,j £ {1,2}, i j. If we have an equilibrium as in 
Theorem \ 5. 1\ for any d, d' £ FF with t & = t 0 ' a.s. on {(i? V d') < ( t° A t^)} - as we do by 
setting r ^ = inf{£ > d \ M t > F t } \/d £ 3F, e.g. - then the strategies and 

form indeed a subgame-perfect equilibrium. 

Proof: In Appendix lB.il 

Even if the time-consistency condition for the family {r^: d £ IP} postulated in the 

$ 

theorem statement holds, we then have a family {TE : d £ £1} that needs to induce time- 
consistent stopping rates (dGf)# e y. This is the main point of Theorem 15.31 given optimality 
by Theorem 15.11 

6 Example: Exit from a duopoly 

In this section we illustrate the simplification we get with games having a systematic second- 
mover advantage, which we pointed out in the context of Theorem 15.11 Specifically, we 
determine subgame-perfect equilibrium strategies by explicitly deriving the Snell envelope Uj J 
and its compensator Dl for a version of the market exit game in Example 12.31 The stopping 
rate during attrition is then represented in terms of a sustained flow of losses from unprofitable 
operations. 

To specify the model, assume that at each time t, discounted duopoly profits are given by 

irf = e- rt (Y t -c), 

where c > 0 is a constant operating cost and revenues (!})*>o follow a geometric Brownian 
motion solving dY = fiY dt + oY dB. If either firm becomes monopolist, the profit stream 
changes to 

vr f A/ = e~ r \mY t - c), 
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where m > 1. Each firm can decide to leave the market with accumulated payoff Lt = Mt = 
/q e~ rs (Y s — c ) ds, for example if Y gets so low that the revenue does not cover the production 
costs. In such a phase the game is a war of attrition if monopoly seems profitable. However, 
it may also be optimal to stop immediately in the follower’s problem 


Ft = L t + ess sup E 

t<T F &£f 


e rs (mY s — c) ds 




The latter problem is a standard exercise under the condition r > max(/r, 0)o and its unique 
solution is to stop as soon as Y falls below the threshold 


/?2 r - [i c 

ym — 75 i 

p 2 — 1 r m 



where is the negative root of the quadratic equation |<j 2 f3(/3 — 1) + p/3 — r = 0. The value 
of the stopping problem can be explicitly expressed as 


F t -L t = e~ rt l Yt>y , 


rnYf 
r — p 


c 

r 


XlY 2 ( mVm - -Y 

Vm) \r-p r)\ ’ 


( 6 . 1 ) 


which shows that F is in fact a continuous process and F = L (= M ) Y < y m . Hence, 
for any equilibrium as in Theorem 15.11 we need t 1 ' > inf{i > 1 ? | Yj < y m } for the endpoint 
condition. On the other hand, stopping is strictly dominant for a monopolist as soon as 
Y < y m , and so it is in duopoly, where revenues can never exceed those in monopoly. Therefore 
we can choose = inf{t > d \ Yt < y m } without loss and it will lead to a symmetric 
equilibrium at any d € S? as follows. 

Since F is a super martingale for m > 1 and dominates L, it also dominates the Snell 

~ $ 

envelope Ul of the latter, such that we have F = L =>■ F = Ul = L. Consequently, L T 
in Theorem I O here is just L stopped at r^, and the Snell envelope U~^ coincides with Ul 
until t°. Applying the RHS of (16.11) with m = 1 yields the solution to optimally stopping the 
leader (duopoly) payoff: 

U L (t) = ess supE[L r | & t \ = L t + e~ rt l Yt > yi 

t<T€5f 



Applying Ito’s lemma shows that the monotone part of the supermartingale Ul is just the 
drift 

dD l — — 1 }\<y 1 (c 1 /) dt 

where immediately stopping L is optimal. With r” = inf{t > d \ Yt < y m } for every d £ 3?, 

32 This is also necessary and sufficient for the processes to be of class (D) in accordance with Assumption 
12.11 Then —c/r < Lt < j Q °° e~ ra |Y S — c| ds G L 1 /P ) and similarly for T, inserting m. 
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dD £ = dDL and (1(1.11) we now have a fully explicit symmetric subgame-perfect equilibrium, 
with payoffs Vf(Gf. G'-) = Ul (i?), respectively. 

As y m < y i < c, we see that dDL is simply the stream of losses resulting from unprofitable 
operations. If a duopolist never hoped to become monopolist, these losses would be too large 
to keep operating. Here, whenever Y £ {y m ,y 1 ), both firms are leaving duopoly at a rate that 
depends directly on those running losses; it is decreasing in Y. The state may rise next to 
the region ( yi,c ). Then there are still running losses, but the firms suspend mixing because 
the option to wait for a market recovery is sufficiently valuable. Thus there is no need for 
a compensation. There will typically be alternating periods of continuous and no mixing. If 
the state drops to [0, y m \, however, the option to wait for market recovery would be worthless 
in the face of running losses even if a firm was (sure to become) monopolist, and both firms 
quit immediately. 


7 Equilibria for general symmetric games 

7.1 Preemption with extended mixed strategies 

In a preemption situation, i.e., when there is a first-mover advantage L > F, ther e typ ically 
exist no equilibria in pure s trategies in continuous time. iFudenberg and Tirolel (|1985T ) and 
Hendricks and Wilson! ([1992) show that this issue arises when there is an incentive to wait (L 
is increasing). If the model is sufficiently regular and the first-mover advantage is strict, then 
one may have equilibria in (standard) mixed strategies, with one player stopping immediately 
and the other stopping at a sufficient rate, such that the first would not be able to realize the 
increase in L. The payoffs are then asymmetric, L and F. However, these equilibria cannot 
be extended to the boundary of the preemption region; if L = F, the necessary stopping rate 
to support any equilibrium explodes. This observation does not depend on any regularity 
conditions, the payoff processes can be arbitrarily smooth and deterministic. 

Therefore, if we want to allow for any equilibria where preemption will be set off (or 
also symmetric payoff equilibria where L > F), we need to enrich the strategies. The key 
is to facilitate some partial coordination when players try to stop at the same time, but 
when simultaneous stopping would be the worst outcome. Hence we make useof t he stra tegy 
extensions af from Definition 12.41 which have been introdu ced in lRiedel and Steel ( 20141 ) and 
which follow in spirit those of Fudenberg and Tirole (| 19851 ). With these extended strategies 
one can capture the continuous-time limits of symmetric, mixed discrete-time equilibria, which 
do not suffer from such problems 1^1 


33 In discrete time, there can be equilibria with a positive probability of simultaneous stopping even if that is 
the worst outcome, because the players can only assign positive probabilities to the single periods; one cannot 
circumvent coordination failure by stopping e after a mass point of the other. 
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We then obtain the following equilibria of immediate stopping for subgames with a first- 
mover advantage - here for a symmetric garneF 


Proposition 7.1 (Riedel and Steg (2014), Proposition 3.1). Fix d € FT and suppose d = 
inf{f > i? | L t > F t } a.s. Then (Gf,af), (Gf,af) defined by 


a i (t) 


1 


^-Lt>Ft 


Lt-F t 
L t - M t 


if Mt > Ft and t = infjri > t \ L u > F u }, 
else 


for any t € [$, oo) and Gf = 1 1 >$, i = 1, 2, are an equilibrium in the subgame at d. 
The resulting payoffs are Vf (Gf, af, Cj , a"j) = max(Ftf, M$). 


Remark 7.2. Where L > F > M, the choice of af makes the respective other player indif¬ 
ferent between stopping and waiting, and ocf(-) is right-continuous, allowing a limit outcome 
argument. Where M > F, stopping is of course the unique best reply. In the polar case 
L t = F t = M t , there might not be a right-hand limit of 1 L t >F t tf-M t ’ so we se ^ a f(^) = 1- If 
the limit does exist, one can use it to make af(-) right-continuous even here, since the players 
will be completely indifferent in this case. 


If Lfl = Ftf > M ,/), e ach p laye r becomes le a der or follower with probability This 


is 


the same outcome as in 


Fudenberg and Tirold ()1985l l for their smooth, deterministic model. 


If L# > Ftf > M$, however, there is a positive probability of simultaneous stopping, which is 
the price of preemption, driving the payoffs down to F$. 


7.2 General symmetric equilibria 

We can now combine the equilibria we obtained for F > L and L > F, respectively. With 
standard mixed strategies, the equilibria for a current second-mover advantage of Theorem 
15.11 depend on the “endpoint condition” A Gf(M — F) > 0, e.g., where a preemption regime 
begins with both players trying to stop immediately. Proposition 17.11 however, gives us 
“continuation” equilibria of immediate stopping at such transitions with payoffs max(I ? ., M.). 
Indeed, if player j uses an extended mixed strategy, the payoff difference for player i between 
stopping and waiting where G^ jumps to 1 at fj = inf{f > $ | a.j(t) > 0} changes from 
AGj (M - F) to 

AGj(aijM + (1 - oij)L - F), 

which is nonnegative if a| is as in Proposition 17.11 Therefore, this possibility to coordinate 
partially in preemption also generates suitable endpoints for attrition regimes where we cannot 
have M. > F. before reaching {L > F}. 

34 The extension to {M > F} is straightforward in the symmetric case. 

35 Then the liminf and limsup in Definition 1C.II are both | with the strategies of Proposition 17.II 
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Now, for any symmetric stopping game - where the payoff processes L , F and M do not 
depend on the individual players - there exists a payoff-symmetric subgame-perfect equilib¬ 
rium: 


Theorem 7.3. Under Assumvtion \2.1\ there exists a pay off-symmetric subgame-perfect equi¬ 
librium in extended mixed strategies (G i,ai), (G^a^) given as follows: 

Pick i,j £ {1,2}, i ^ j. For any id £ 2F, set r^ := inf{t > id\ L t > F t or M t > F t }. 
Define Gf, Gf as in Theorem, 1,5.71 and af = af as in Proposition 1 7. /I 

Further, if the payoff processes are such that for any stopping time t £ FT with L r = F r , 
either F T = M T or t = inffi > r | L t > F t } a.s., then there is a symmetric subgame-perfect 
equilibrium using Gf from Theorem \5.1\ for both i = 1,2 and any id £ FT. 


Proof: In Appendix lB.il 

The idea of these equilibria is virtually pasting the war of attrition that we have on 
{F > L} using the continuous strategies by Theorem 15.11 with the preemption equilibria of 
immediate stopping on {L > F} by extended mixed strategies as in Proposition 17.11 

By the upper-semi-continuity of Assumption 12.l[(iii) it is during attrition feasible for the 
players to coordinate on a future continuation equilibrium with payoffs max (A., Mi). Then 
there will be no predictable drop in payoffs from setting off preemption. The corresponding 
equilibrium payoffs are 


V? = V? 


ess sup E 


1 t<t #L t + l r>T tf ma x(F T 0,M T ti) 



with = inf{t > id | L t > F t or M t > F t } for any dd £ FT. 

While the endpoint condition A Gf (. M — F) > 0 at is now replaced by the preemption 
continuation equilibria, we still need to ensure the second one, A Gf(M — F) < 0 at 1); 
the cap A inf{t > id \ Mf > Ft} works generally, but there may also be alternative choices 
in more specific cases. The proof of Theorem 17.31 relies of course on those of Theorem 15.11 
and Proposition 17.11 The main issue is that the former was formulated in a reduced setting 
with “standard” mixed strategies, so we establish a formal relation to the present setting with 
extended mixed strategies. 


8 Efficient symmetric equilibrium 

The equilibria of Theorem 17.31 involve maximal preemption - wherever L > F - and thus 
have a relatively simple structure: the game ends as soon as there is a strict first-mover 
advantage. Preemption need not be that severe if there are future continuation equilibria 
with sufficiently high (expected) payoffs. In this section we identify equilibria with least 
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possible preemption, entailing the highest attainable equilibrium payoffs. We focus on the 
class of pay off-symmetric equilibria , which are the subgame-perfect equilibria with Vf = Vf 
a.s. at any stopping time i9 € FT. These have strong implications for equilibrium strategies. 
Then, in competitive games, where M is throughout the lowest payoff, equilibrium payoffs are 
at most what can be obtained from optimally stopping min(F, F) - no matter how players mix, 
possibly using public correlation (Proposition 18.ID . This bound on equilibrium payoffs enables 
us then to identify inevitable preemption points: those where the leader payoff L exceeds 
any continuation equilibrium payoff. Theorem 18.21 formulates a corresponding algorithm and 
establishes the existence of “efficient” subgame-perfect equilibria. 

It is quite clear that any stopping on {T 1 > L} must induce the lower payoff L if M is 
generally the worst. The basis of our argument is the more subtle result that players also 
cannot exploit L > F by mixing in any payoff-symmetric equilibrium, even if they have no 
time constraint. 

Proposition 8.1. Suppose M < min (L,F). Then, in any pay off-symmetric equilibrium and 
for any ft <E ST, i, j <E {1,2}, i f j, 

Vf(Gf, of, Gf , af) < U LAF ($) = ess sup E [L t A F r | , 

where we can in fact restrict ourselves to stopping times r < inf{t > d \ Gf V Gf > 1}. 

The proof of Proposition 18.II in Appendix lB.2l is based on the following important facts for 
any payoff-symmetric equilibrium (which do not depend on the assumption M < min(F, F), 
yet): First, the conditional stopping probabilities of the players must be the same on {F L} 
(Lemma IB.2D , since a player who stops with a higher conditional probability also becomes 
leader with a higher conditional probability, whereas the other becomes follower on that event. 
As one consequence, Gf and Gf must then even be identical before they put any mass on 
{F = L} (Lemma IB.3D . As another consequence, on {F L} we can only have simultaneous 
jumps. These are only possible if M. > F, or if preemption occurs with L. > F. > M.. Most 
importantly, we cannot have any jumps where F > max(F, M ). Finally, the local payoff from 
any terminal jump is bounded by max(F, M ) (Lemma IB.4D . 

The intuition for Proposition 18.11 is now the following. In any equilibrium, player i must 
be willing to wait until any point at which Gf < 1, and stop only from there on with the 
corresponding conditional probabilities. Consider as such a point the first time at which any 
player puts some mass on {F > L}; call it f. By waiting until f, player i might become 
follower where Gf increases earlier on {F < L}. At f, by definition at least one player is 
willing to stop. The corresponding (symmetric) local payoff is clearly Lf < Ff if Gf, Gf are 
continuous; a jump can only occur if indeed Ff = Lf, which is also the maximal local payoff 
(with the hypothesis M < min(F,F), we cannot have any jump where Ff > Lf as we have 
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seen). Finally, it might happen that Gf is exhausted on {T < L}, before ever reaching f. 
Then, however, we must have Gf = Gf- If they jump to 1, the terminal payoff is at most 
F < L; if they attain 1 continuously, this means in the limit becoming follower for sure on 
{F < L}. In summary, player i never receives more than min(L, F) where stopping occurs. 

Proposition 18.11 implies that whenever L$ > Ul/\f($), we must have Gf('d) V Gf^d) = 1 
by preemption l 36 l If there are any such preemption points in the future, they also restrict 
the feasible stopping times r to maximize the expected value of min(L, F) in Proposition 18. 11 
which even further reduces the maximally attainable equilibrium payoff. By iteration we can 
identify where preemption is inevitable. 

Theorem 8.2. Suppose M < min(L, F). Then there exists a maximal pay off-symmetric 
equilibrium with value 

V? = V? = ess sup E[L t A F t \&#\ := U, LAF)fm {d) 

tG[ #,f (ft)] 

for any d € TT, where f(d) is the latest sustainable preemption point after d determined by 
the following algorithm: 

(i) Set to(i9) := inf{f > d | L > Ulaf } and 

(L A := (L tAToW A F) AtoW )^ 0 

with Snell envelope U( LAF y 0 W = (esssup f<rG ^ E[(L A F)f oty0] \^t]) t>0 - 

(ii) For all n G N set r n (d) := inf{t > d \ L > U^ LAF y n _ 1 (-§)} A r n _i(i?) and 

(L A F) T "W (L tATnW A F tATnW ) t > 0 

with Snell envelope U^ LAF y n (^) = (ess sup i<rG ^ E\(L A F)f n ^' > \ ^t]) t>0 - 

(iii) Take the monotone limit f(d) := lim n _ > , 00 r n ($). 

Proof: In Appendix IB. 31 

The payoff-maximal equilibrium is implemented using the strategies of Theorem 17.31 but 
setting af = 0 before f(d). Constructing f(d) by the algorithm is technically not difficult. 
The main problem is rather to verify the claimed equilibrium properties: to make sure that 
there is no preemption incentive where L > F on [$, f (t?)), that f(d) is indeed maximal, and 
that there is a continuation equilibrium of preemption at f(d). Further, measurability is a 

36 This argument is not impaired by any jump AG^(tl) £ (0,1) due to which player i could not realize L$. 
L is right-continuous, so player i could try to stop right after ft. The formal argument is given in the proof of 
Theorem 18.21 
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major technical issue, since we want to have a time-consistent version of the strategies where 
we set ctf = 0 on [i?, t(i9)) for all i? £ &, to achieve the maximal payoff in all subgames. 

In order to suppress preemption where L$ > F$, it is obviously not sufficient that there 
exists r > i? such that E[L r A F r \ this relation then rather has to hold on all of 

ft?, t] n {L > F}. For instance, the algorithm of Theorem 18.21 can be applied to the model of 


Fudenberg and Tirolel (J1985) by visual inspection, shown in Figure [TJ 



Figure 1: Preemption, Fudenberg and Tirole (1985) 


L exceeds the future maximum of min(L, F) for the first time at tq, whence there will be 
preemption. Taking that into account, at most the maximum of min (L,F) up to To might be 
achieved. However, L will also exceed this reduced value, at ti. In the limit, t(0) = T\ is 
the first inevitable preemption point. Fudenberg and Tirole also consider another, Case B, in 
which the peak at T 2 is higher than first one. Then t(0) = 00 , because L = F at and from 
their global peak onwards, and the player can coordinate on joint late adoption. In general we 
may have much more complex stochastic patterns, of course, with arbitrary regions of first- 
and second-mover advantages, that may trigger preemption or not. 


9 Conclusion 

In many timing games mixed strategies play an important role as we have argued, either for 
equilibrium existence or to resolve any strategic conflicts (about roles with differing amenities) 
within the game. Having analysed the two different kinds of local strategic incentives, we have 
been able to prove existence of and to characterize quite explicitly subgame-perfect equilibria 
for general symmetric stochastic timing games, providing symmetric equilibrium payoffs. Our 
approach is based on the general theory of optimal stopping and demonstrates which kinds of 
stopping problems need to be solved to verify equilibria, not only but in particular for mixed 
strategies. 

There are possibly different equilibria for a given timing game, with varying degrees of 
preemption. We have considered the two extreme cases: If one initiates preemption whenever 
there is any first-mover advantage, payoffs may be severely restricted. However, we have shown 
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how to reduce preemption to a minimum and proved existence of corresponding equilibria with 
maximally attainable payoffs. If preemption can indeed be prevented in a certain regime with 
first-mover advantage (by sufficiently profitable future continuation equilibria), then there 
may also exist further equilibria with continuous mixing, which we have only employed at 
second-mover advantages. Nevertheless, any such additional mixing will be inefficient and 
induce lower payoffs (which one can also show directly). 

A more specific strate gic i nvestment model with random first- and second-mover advan¬ 
tages is analysed in lSteg and Thijssen (2015), where the strategies corresponding to the ones 
derived here have Markovian representations. 


A Technical results 


Lemma A.l. A measurable process X = (Xf) te m> is of class (D) iff the set {X T : t a stopping 
time } is uniformly integrable for any given X^ € L 1 (P). 

Proof. We only need to show necessity: Let X be of class (D) and fix arbitrary X^ G 
L l (P) and let 3F denote the set of all stopping times. Then, for any r £ PA and n € N, 
|X T An| 1 t<oo < |X rAn |. Hence the set {|X rAn | l r<0 o: r € IX, n G N} U {\X T \ : r < oo 6 IX} is 
uniformly integrable as well. As we may also include limits in probability of its elements, and 
\X T \ l r <oo = hnin^oo |A rAn | l T <oo a.s. for any r 6 J', we observe that {|X r | l r<0 o: t G IX] 
is uniformly integrable. 

With Xqo € L l (P), also {| ATqo | l T=0 o: t G IX} is uniformly integrable. Now let | > 0. By 
uniform integrability there exists a 6 > 0 such that max{i?[|A T | 1 t<0 o1a], A[| Aqo 1 t=00 1a]} < 
| for any measurable A with P(A) < 5 and any r G IX. Hence we have A[(|X r | l r <oo + 
I-XjjoI 1 t=00 )1a] < e, which shows that {|A T | l T<0 o + |Ax>| 1 r =oo: t € IT} is uniformly inte¬ 
grable as claimed. □ 

Lemma A.2. If L is a (measurable) process of class (D) then there exists a constant K G 
such that for any process G that is a.s. right-continuous, non-decreasing, non-negative and 
bounded by some G 0 0 G L°°(P) and all random variables 0 < a < b < oo a.s. we have 


(i) 


E 


[a,b) 


\L t \ dG t 


< A||Goo||oo < oo and 


P P OO 

/ dGt / | ^r G (a:) I (x)&[a,b) ^X ^ OO a.S ., 

J\a.b) JO 


[a,b) 

where t g (x) := infjt > 0 | Gt > x}, x G M+, and AGo = Go; equivalently, “Gt > x” in 
t g (x). 

If{\Lr\ lr<oo • x G IX 1 is bounded in L°° (H) by A G 1R_|_ and G bounded by some Goo G 
L l (P), (i) holds with KE[Goc,\ instead and (ii) as stated. 
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is the left-continuous 


Proof. The (a.s.) non-decreasing family of stopping times (r G (x)) xeR 
inverse of G, which satisfies 

t g {x) <t Gt > x. 

Thus, with the convention Jj 0c j dG = G c , we have Jj 0 ^ lyi dG = J () °° dx for all 

A £ {[0, c]: c € M + } and hence for A = M + by monotone convergence, a.s. By a monotone 
class argument^ the relation holds on all of ^(R+) a.s. 

Since L.(oj) : M+ —>■ M, 1 1 —>• is Borel measurable^ like the function lie[a.(oj),6(c l ;)) ? we 

now obtain the following change-of-variable formula^: 



L t | dG t 


' {r G (a:)<oo} 


L 


t g (x) | *-t g ( x)£[a,b) 


dx 


a.s. 


As inf{f > 0 | Gt > x\ = r G (x+), which differs from r G (x ) only on a set of Lebesgue measure 
(dx) 0, we can equivalently use the former. By Fubini’s Theorem 


' f 00 . 1 [W G ° 

J I ^~t g ( x)£[a,b) dx T J 


E 


L. 


r G (x) | *-t g (x)<oo 


dx. 


(A.l) 


As L is of class (D), {\L r \ l r<0 o: r £ is bounded in T 1 (P) by some K < oo, whence the 
RHS of (I A. ID is bounded by ATHGooHoo if the latter is finite. If sup Tei y ||L T 1 T<00 || 00 < K and 
G bounded by G Q0 £ L 1 (P), then the RHS of (1A.1I) is bounded by 


E 


KG 


G (x)<oo 


dx = E 


KG 


Go 


G (x)<oo 


dx 


< KE[Goo} < oo. 


In either case it follows that f^ ab ^ \L t \ dGt < oo a.s. □ 

Lemma A.3. Suppose the processes L, F, and M are optional and of class (D) and the 
adapted process G is right-continuous, non-decreasing and taking values in [0,1] a.s. Then S 
defined by 

S u '■= f F s dG s + AG U M U + (l — G U )L U , u £ M_|_, 

J [0,u) 

is optional and of class (D) as well. 


Proof. The components of S are obviously optional, in particular the integral being a left- 
continuous and AG the difference of a right-continuous and a left-continuous adapted process. 
We have to show that ('S'( r )) r<00 as is uniformly integrable and obviously only need to con¬ 
sider the first, integral component of S. By the Theorem of de la Vallee-Poussin it is necessary 


37 See, e.g., 
38 See, e.g., 
39 See, e.g., 


Jvallenberg 

Kallenbere 


2002), Theorem 1.1. 
(20021), Lemma 1.26 (i). 


Kallenbergl 20021) . Lemma 1.22. One needs to restrict dx to {r G (x) < oo}, which is redundant 


whenever we have [a, b). 
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and sufficient for a process X to be of class (D) that there exists a non-decreasing and convex 
function g: —> M + such that lim^oo = oo and sup T<00 £/[g(|X r |)] < oo. 

Borrowing the corresponding function g that there is for F and by a change of variable as 
in Lemma lA.21 we obtain 


sup E 

T <00 


< sup E 

T<00 


F, dG f 


'[0,r) 


9{\F T G( X )\) dx 


< sup E 

T <00 


E 


9yJ 0 \d^T G (x)\^~T G (x)<T dx 

g(\F T G( x )\) dx < oo. 


< SU P<7<00 E [s(|At|)]<00 


We used Jensen’s inequality for the last estimate. 


□ 


Lemma A.4. Consider two stopping times a < r and an event C G ■T r „. Then 


d := ale + t! c c 


is a stopping time. If the filtration is complete, it suffices that a < r a.s. 

Proof. To verify that d is a stopping time, we check whether {$ < u} G for all u G M+. 

First note that 

{d <u} = ({a < u} PI C) U ({t < ri} PI C c ). 

The first intersection belongs to TF U by definition of Then also its complement ({a < 
u} fl C) c = {<j>ii}UC c G .F u and thus C c fl {a < u} G ,’F U - Finally, as {t < u} G we 
conclude 

C c PI {a < u} fl {r < u} = C c fl {r < u] G ,'F U ■ 

If a < r only a.s., the last equality holds up to the nullset C c Fl {a > u} fl {r < u}, which is 
contained in if the filtration is complete. □ 

Lemma A.5. The process L T := 1 t<T $L + 1 (>T « max(F T ^,M T o) defined in Theorem I5.il 
is upper-semi-continuous from the left in expectation on [$, oo], where D^ is hence left- 
continuous a.s. 

Proof. Let (r n ) be a sequence of stopping times that is a.s. increasing and taking values in 
[#, oo], and denote the limit by r G 3T. Define the measurable set 

A := H{r„ < A 

n 

Then lim ra L Tn = L T a.s. on A c , implying lim n E[lA c L Tn ] = E[1 a c L t \ since L is of class (D). 

We obtain the analogue for L AF if we use (r n At’ 3 ) and (rAr), respectively. Combining 
the latter fact with upper-semi-continuity from the left in expectation given by Assumption 
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I2.l|(iii) yields 


lim sup .E [1^(1/ A F) TnAT & + 1 a c {L A F) TnAT #] < E[1a(L A F) tAt # + 1 a c {L A F) rAr ^] 

n 

=$■ limsupE[lA(L A F) Tn ] < E[1a(L A F) r \. 

n 

Note that (L A F) Tn = L Tn on A and (L A F) T < L r on A C {r < t '' 1 }, implying the first 
claim. 

Finally we show that if L is upper-semi-continuous from the left in expectation on an 
interval [i?, oo], then the paths of are left-continuous on that interval, a.s. Define as an 
auxiliary process L := l t< $M+l t >^L, where M is a right-continuous version of the martingale 
(E[Ltf\JFt])t>o, which is uniformly integrable and thus (left-) continuous in expectation thanks 
to optional sampling. L inherits upper-semi-continuity from the left in expectation because 
E[L Tn ] = E[l Tn< #L# + 1 Tn >$L Tn ] = E[L TnV $] and similarly E[L T \ = E[L tV #\. Since L and L 
agree on [i?, oo], their Snell envelopes and U A agree at any stopping time in that interval 
and thus Ufflt>$ and U L lt>$ are indistinguishable by the uniqueness of optional projections. 
The same holds for the compensators Dj^ and D^ on [$, oo] by uniqueness of the Doob-Meyer- 
decomposition. D^ is left-continuous a.s. since L is upper-semi-continuous from the left in 
expectation, see footnote [Ml □ 

Lemma A.6. Let A, B: ->1U {+oo} be two right-continuous, nondecreasing functions, 

0 < A B < 1. Then the differential equation 


dG = (1 - G) dA, G 0 - = a £ R (A.2) 

has the solution 

G = 1 - (1 - a)e-f dAC n(l + AA)" 1 = 1 - (1 - a)e~f ^ c -£in(i+AA) ; 
where A c = A — A A £ [Ao, A] is the continuous part of A, and the differential equation 

dG = (l-G-)dB, G 0 -=b£R (A.3) 


has the solution 

G= 1 — (1 - b)e~S dBC n(l - A B) = 1 - (1 - b) e ~I dBC+ ^ Hl ~ AB) . 


Any solution to 4MD or tin is monotone towards, but never crossing, 1. 


equations iff AA = 1 AB B , resp. A B = 


A A 

l+AA- 


G solves both 


Proof. Straightforward to check. Note that dGo 


(1—a) dAo 
l+AA 0 


for any solution to (IA.2D , implying 
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both monotonicity on either side of 1 and that the latter can never be crossed. Indeed, 
Go ^ 1 <£> AG 0 ^ 1 — G 0 - = 1 - a 0 ^ 1 - a. 

Similarly dGo = (1—6) d£>o for any solution to flA.il ll . implying Go ^ 1 AGo ^ l-Go_ = 
1 — 6<t^0^1 — b and ABq < 1, where ABq G [0,1] is used for the last equivalence. □ 


Lemma A.7. Let (Y t ,Z t ) te r 0 u be a family of random variables. Assume that the family ( Y t ) 
is uniformly integrable and that ( Z t ) is bounded in L°°(P) and Z t —> 0 in probability as t —> 1. 
Then 


lim E[Y t Z t \ = 0. 


Proof. (||Z t ||oo) is bounded by a constant K, hence (Zt) is uniformly integrable and converges 
to the constant 0 also in L l (P) as t —> 1. Since (Yt) is uniformly integrable, we can find for 
any e > 0 a suitable constant K e > 0 such that 


E[l\ Yt \>K e \Y t Z t \] < eK for all t <5 [0,1]. 


In combination, 

lim sup-E[|YtZt|] = limsup{£i[l|y t | >ife \Y t Z t \] + E[l\ Y \ <K \Y t Z t \]} 

t-S-l i—5^1 L } 

< limsup£f[ljy t | >x E \Y t Z t \\ +\imE\l\ Yt \ <Ke \Y t Z t \] < eK 

t-u 

and the claim follows. □ 

B Proofs 

B.l Proofs for results in Sections [HHT] 

Proof of Lemma 13.11 For Gf feasible we can define the right-continuous inverse 

T f’ d (x) := infjs > id | Gf (s) > x}, x G M + . 

As in Lemma lA.21 it leads to the change-of-variable formula 

f Sf(s)dGf(s)= f Sf(T^’ d (x))l G ,t> { ) dx a.s. 

J[a, oo) Jo * 

Further we have x > Gf( oo— ) => rf'^(x) = oo =4- x > Gf( oo— ), i.e., 1 x> qv (oo-) A 
1 r G,* (a .j =oo < l x > G ?(oo-) for a11 ieR + a - s -> implying 

AGf(oo)Sf(oo) = ^ l T G,o (a , ) _ oo dx'jSf (oo) a.s. 
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Thus 


V?(Gf,G»)= [ E[sf(T?’*(x))l r 9 ,o {x) 


)<oo 




dx + / E 
Jo 


= / E 
Jo 


St{T?’\x)) 




dx 


Sf(o o)1 t g,# ( x)=00 |^] dx 

(B.l) 


< f ess sup E [Sf (r) | # 0 ] dx = ess sup E [S'f (r) | . 


□ 


Proof of Lemma 14.11 By right-continuity of L, we have Sf(r+) — Sf(r) = A Gj(r)(F T — 
M r ) > 0 for any r £ Now consider the set {r^ < r^(d)} on which we have, a.s., 


E 


'S?(T L (0)+)-SfW 




= E 

> E 

> E 


tl 


s dG%s) + (l - G?(t l (0)))l tlW - A G?(Ti)M n - (l - Gf (rO)i 
w (gJ(t l (0)) - G'j (Tj—)) + (l - G^(d)))L riW - (l - G?(Ti-))L Ti 




Ti 




&T. 


L 


'tlW 


(l-Gj(Ti-))-(l-Gt(ri-))i- 




> 0. 


The first inequality is obtained from a change of variable (demonstrated below) similar to that 
in the proof of Lemma I3.1L exploiting that E is a super martingale, and L > M. The second 
inequality is due to F > L and the last to the optimality of Note that the last one will 

be strict if P[r l < tl (d) and Gjfa—) < 1] > 0 by suboptimality of any d < r* < ri,(d). The 
second claimed estimate of the lemma follows from setting r, = d in the steps above. Further, 
the previous and following steps go through identically with Ti,{d) replaced by r L (d). 

The change of variable proceeds as follows: 


E 

= E 


»,Tl(0)] 
r 1 


F s dG»(s) 


Pr. 


F rf'°(x) Kf'* (*)e[ri ,r L (0)] 




= E F 


[ r rf^(x) L rf^(x)G[T U T L ^)] 




dx 


r -1 


> 


E 


I o 


E 


F n 


= E 


tl($) 


tl(0) 

r 

'o 


SF G ,# 


(®). 


rf’ # (*)e[Ti,ri,(j?)] 




dx = / E F. 


10 


G, i? 


(x)e[Ti,T£(l?)] 


dx 




= E 


K L «,)(Gf(ri.m)-G?(n-)) 


tlW r G ’’ , (x)e[r i ,r i (i?)] 




dx 


□ 


Proof of Proposition 14.21 Let d £ EE and C £ & TL ($y Then r*, r| as hypothesized are 
stopping times thanks to Lemma [A.41 as T£,(i?) < Tp(d) a.s. 

To verify the optimality of r*, it suffices by Lemmata 13.11 and Id. II to consider stopping 
Sf from t 'ij ('d) • On C, S'f(t) = L tATF ^ for all t > Ti,(d), such that stopping immediately at 
7x(i?) is optimal by its optimality for L. On C c , S'f(i) = F Tl <> M Tl ^ for all t > tl (d), 
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with equality on { tf (■#) = by hypothesis. Hence, Tp{d) is optimal on C c . The same 

argument applies to r|, swapping C and C c . 

We can use Tpijd) := inf{t > $ | F t = M t }, since it does not occur before tl (#), a.s. 
Indeed, as F is a super martingale dominating L, it also dominates the Snell envelope Ul- 
Therefore, at Tp('d), F = M (by right-continuity and F > M), implying Ul = L by L > M. 
Hence, tf($) > inf{i > d | Uj^t) = L t } = tl($). □ 


Proof of Theorem 15.11 L rl5 is right-continuous a.s. and of class (D), so it has a Snell en- 
velope Uf with an integrable and predictable compensator D - . We write for simplicity L 
and D £. The latter is continuous on [i?, oo] a.s. since L there is upper-semi-continuous from 
the left in expectation, see Lemma IA.5I and footnotes [Til [T71 

Now Gf is a feasible mixed strategy, as it is clearly adapted and a.s. right-continuous and 
non-decreasing, taking values Gf = 0 on [0, d) and Gf(oo) = 1. The only possible jump 
occurs at r 4 G,1? (l) := inf{f > •& \ Gf (t) = 1}. 

Gf as defined in (15.21) is even continuous up to r . 1 f > l/{F — L) can be understood 
as a Radon-Nikodym derivative, such that the integral defines a measure on M + , which is 
absolutely continuous with respect to the (finite) measure dD^ having no mass points@ 

To prove first that Gf is a best reply to Gf we will show in view of Lemma 15.11 and its 
proof that 



> E 


Sj ( T ) 



for all stopping times t > d, with equality whenever dGf > 0 (implying equality in (1B.1D L 
In fact, we establish the stronger condition 


E 


Sf(r°) - Sf (r) 




> 0 


(B.2) 


(with equality whenever dGf(r) > 0), where it suffices, however, to consider stopping times 
r < r^, since A Sf^) = AGf (t' 9 )(F t ^ — M r o) < 0 by hypothesis, and Sf is constant on 
(r- (l),oo]. 

To ease readability in the following demonstration of (IB. 21) . we simply write G* for Gf , Sj 
for Sf, a for r and b for t G,1? ( 1). By the other hypothesis A Gi{F — M) > 0 at b < r d , now 
Sj (r^) = J^ ob -)FdGi + AGj(6) max(F(,, Mb). Further, our Gi satisfies 

dGi(s) = (1 - Gi(s)) p 

for all s G [a, b) a.s., implying 

f {F s — L s ) dGi(s) = f (1 - Gi(s)) dD L (s), (B.3) 

J [a, b) d[a,b) 

40 The new measure is also cr-finite as {F > L} = (J N {F L > L}. 
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where / LdGi is well defined by Lemma lA.2l for L of class (D). We apply integration by parts 
to the RHS (adjusting for [a, b) closed on the left, open on the right, and recalling that D^ is 
continuous) to find 


f (1 -Gi{s))dD L {s)= [ D L ( 8 )dGi{s) (B.4) 

J [a,b) J [a,b) 

+ (1 - Gi(b-))D L (b) - (1 - Gi(a-))D L (a). 


With the help of Gi(b) = 1, (IB.31) and (IB. 411 we see that 


Sj(r d ) - Sj(a) = ( F s dGi(s) + A Gi(b) ma x(F b , M b ) - (l - Gi{a))L a - A Gi{a)M a 
J [a,b) 


[a,b) 


(L s + D l {s)) dGi(s) + (1 - Gi(b—)) (max(F fe , M b ) + D L (b )) 


— (l — Gi(a—)) (L a + D^(a)) + A Gi(a)(L a — M a ). 

Now, as the martingale component M^ of the Snell envelope is uniformly integrable, / M^ dGi 
is well defined by Lemma I A. 2 1 By the change of variable proposed there we find that 


E 


J[a,b) 

= fs 


Mi(s ) dGi(s) 




= / E 
Jo 


M L( T i( x )) 1 rP(x)e[a,b) 




E 


M~ L (b) 


^rf(x) 


’(x)£[a,b ) 




dx = E 


M L (b) 


dx 


r-P (a;)e [a,fe) 


dx 


^ n 


= E 


M z (6)(G,(6-)-G i (a-)) 


-s a 


= -E 


M L {b){l - Gi{b-)) J? a + M L (a){l - Gi(a-)) 


Combining the last two results yields 
E[Sj(T*) - Sj (a) | ^ a ] 

/ (L 3 + D l (s ) - M l (s)) dGi(s) + (1 - Gi(b-)) (ma x(F b , M b ) + D L (b) - M~ L (b )) 

.J\ a,b) 


= E 


- (1 - Gi(a-)) (L a + D L (a)~ M L (a )) + A G t (a)(L a - M a ) 




(B.5) 


It remains to observe that L + — M^ = L — U^ = 0 whenever dD^ > 0, i.e., when 

dGi > 0 on [«,&), which makes the integral vanish; cf. (13.611 . The same argument applies to 
the second term where A Gi{b) >0: if b < r , then the jump must result from dD^{b) > 0 
and F b = L b = L b = Ui(b) (> M b by hypothesis); if b = , then max(F b , M b ) = L b = Ui(b) 

as L is constant on [r , oo]. As A Gi(a) = 0 on {a < 6}, we are here left with 

E[Sj(r*) - Sj{a)\^ a ] = (1 - G^a-)) (U L (a) - L a ) > 0, (B.6) 
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with equality whenever dGi(a) > 0. On {a = b}, we collect terms to 


E[Sj(T*) - Sj (a) | jr a ] = A Gi(b)(U L (b) - M b ) > 0 (B.7) 


due to Ui(b) = ma x(F b ,M b ) > M b a.s., as we have argued before. On {b < r^}, dGf puts no 
mass on [6], On {b = t 3 }, (IB. 71) is binding iff A Gf(M — F) > 0 a.s. at t 3 < oo (for necessity 
of this condition for equilibrium note that A Gf{r 3 ) > 0 =>■ A Gf{r 3 ) > 0). This establishes 

4E2D- 

In the case that A Gf(M — F) = 0 a.s. at inf{f 6 R + | Gf(t) = 1} < oo, the identical 
arguments show that Gf = Gf is a best reply to itself, because then Sf is constant on 
hP’^(l),°o] (i-e., Sj(r 3 ) = Sj(b ) in flUZD). 

There are some slight variations to the above in proving that Gf is a best reply to Gf / Gf 
without the previous additional condition. The analogue to ()B.2I) that we seek is 



(1 ))-Sf(r) 



(B.8) 


for all stopping times r G [?9, with equality whenever dGf > 0. Afterwards we will 

show that at rf''^ ( 1) it is optimal to stop immediately. 

To derive (IB. 81) we can apply similar arguments as above. The main difference is that 
switching to Si = Sf and Gj = Gf while keeping b = t^’^( 1), we may have Gj(b) < 1. 
Nevertheless, A Gj(b)M b = AGj(b) max(F b , M b ) (in particular A Gj(b) = 0 on {b < r^} U 
{A Gi(b) = 0}), so that on the one hand Si(b) = Sj(T & ). Indeed, Gi = Gj on [0,6), so 
Siify-Sjir®) = (l — Gj(b))L b +(AGj(b) — AGi(b)) max(F b , M b ) = 0 on {b < r tf }n{AGi(6) > 
0} - the only set where they might differ - but there L b = F b (> M b by hypothesis) and 
Gj{b—) = Gi(b—). This implies payoff symmetry once we have (IB. 81) . On the other hand we 
get analogously to above (with possibly Gj(b) < 1) 


E[Si{b)-Si(a)\& a )=E 


f (L s + D l (s) - Mjj(s)) dGj(s) + AGj(b)(max(F b , M b ) - L b ) 
J [a, b) 


,,b) 

+ (1 - Gj (b-))(L b + D L (b)-M z (b)) 

- (1 - Gj(a -)) {L a + D L {a) - M L (a )) + A Gj{a){L a - M a ) 


(B.9) 




The integral vanishes as before. Since b is still the same, on {b < r tf } again F b = L b = Ujj(b) > 
M b : on {b = t' } } again ma x(F b ,M b ) = Ui(b) and A Gj(b) = (l — Gj{b—)). This eliminates 
the second and third terms. For any a < b, AGj{a) = 0, hence 


E[Si(b) - Si(a) \& a ] =(1 - Gj{a-)){U L {a) - L a ) >0, (B.10) 


with equality whenever dGi(a) = dGj(a) > 0. This proves (IB. 81) . 
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Let now a = 1) and b = r any stopping time taking values in (Tj- (1), oo]. It remains 

to show that E[Si (&) — Si (a) \ & a \ < 0 a.s. If a = r^’^l), then £*(&) — Si(a) = A Gj(a)(F a — M a ) 
and AGj(a) > 0 at a = = T f’^( 1) only if a, = t & , whence Si(b) — Si(a) = A Gi(a)(F a — 

M a ) < 0 by hypothesis. If a < T^' d ( 1), we have 0 < (F — L)dGj = (1 — Gj)lF>LdDi < 
(1 — Gj)dDi on [a, b). Together with A Gj{b)Mi < AGj(6) max(F&, M;,) we get “<” in (IB. 91) . 
On {b < r 1 '} we now have Lf, = L& < U^b ) and A Gj(b) = 0, so that “<” also replaces the 
equality in (IB. 101) . As now a = 1) < t^’^( 1), dD^(a) > 0 and U^(a) = L a . This finishes 

our proof. □ 


Remark B.l. Theorem 15.11 remains true if L is only upper-semi-continuous from the right 
(and the left), but L = M. Then D^ will be left-continuous (see footnote 1241) and there exists 
a feasible strategy Gf given by 


Gf(t) := 1 - exp< - 


dD c f 

1j 


(*) 


F s — L s 


-EM 


(AD-Js) 


\F S — L s 


+ 1 


for t £ [i?, t^), where D~ is the continuous part of Di and AD^(s) = D^(s+) — Di(s), and 
which satisfies 


dGf(s) = (l-Gf(s)) 


dDi(s +) 
F s - L s 


Then the only modifications in the proof are that we put dD^( ■ +) on the RHS of (IB. 31) . resp. 
the LHS (only!) of (IB. 41) . We do not have right-continuity of — L , but Jj a ^— L) dGi = 
0 still holds in (IB. 51) : The argument of footnote ESI applies to A D^, which has the same 
support as AG*. The continuous part dGf is absolutely continuous with respect to dD £, 
for which we can apply a change of variable similar to Lemma but with t d l(x) := 
inf{t > 0| Di(t) > such that = L a.s. at t d l(x), x £ M+, cf. (13.41) . Hence, 

E U[0,oo) 1 U l >L^ D l\ = fo E i 1 U L (t D l(x))>L(t D l (a;))] = 0. Finally we employ L = M to 

arrive at (IB.6D if AG(a) > 0. 


Proof of Theorem 15.31 We only need to establish time consistency. If the hypothesis holds, 
{(i? V d') < (r l9 A r^)} differs from{(d V d') < t § = t 0 '} := A £ a t most by a nullset. 

Only this event is relevant for time consistency, since (d V d') > (r' d A a.s. on A c and 
there is no restriction where Gf V Gf = 1. With L rl = L r ' a.s. on A, also 

ess sup E[L T (t') \ & T \ = ess sup E[L T (r') | ^ r ] a.s. 

T<T'e^ t<t'&3T 


on {r > (d V #')} D A for any r £ 3T, implying £/~*l t >(0 V 0') = 1 t >(^vi9') a - s - on A 

(i.e., the latter two processes are indistinguishable) by the uniqueness of optional projec- 

41 For (left-) continuous D i, t° l {x) < to Di(t) > x. 
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tions. Correspondingly, Df = D T ^ on [i? V d', oo] a.s. by the uniqueness of the Doob-Meyer 
decomposition. 

The time consistency condition in the equivalent form (l — Gf(t)) = (l — Gf (i? ; —)) (1 — 
Gf'(t )) reads for the given Gf 

i t< T » ex p(- / dD t K F ~ L )) 

= l^, <T ^exp(-/ dD T L * /(F - L))l & exp(- f dDf /(F-L)), 

- v 1 9 J 1 

which on A and for t > (d V i?') reduces to the true statement 
1 t< T » ex p(-/ dD r l/(F - L)\ = l t<T # exp(- / d.D'ff /(F - L)) exp(- f dD T ff /(F - L)) 

thanks to what we have shown before. The argument for j is analogous. □ 

Proof of Theorem 17.31 Consider the subgame starting at a given d € d? and let (Gf , af) 
and (Gf, af) be a pair as hypothesized. First note that t 9 < inf (t > d \ af (t) + af ( t ) > 0} = 
f 9 (cf. Definition lC.il) . such that Gf = Gf = 1 a.s. on [f^, oo]. All other feasibility conditions 
for the extended mixed strategies follow from those of Theorem 15.11 and Proposition 17.11 
Now let i, j £ {1,2}, i ^ j arbitrary in the following (not necessarily the roles assigned in 
the theorem) and consider player i deviating to some admissible (Gf,af). G 9 is continuous 
up to Tj := t G ’ l9 (1) = inf{f > 0 | Gf (t) > 1}, whence only Gf matters on [d,Tj) (cf. Definition 

EH): 

Vf(Gf,af,Gf,af) = V*(G >, l t > Tj af , Gf, af). 

So let af = 0 on \d, Tj) wlog. regarding optimality. Then Tj < inf{f > id \ af(t) + af(t) > 0}, 
implying 



by iterated expectations, with Gf 3 and Gf determined by time consistency (in particular, Gd 
arbitrary where Gf(rj-) = 1). Where Gf jumps to 1 before r , by construction F Tj > M T;j , 
so waiting is a best reply, e.g. playing Gd := 1 t>T #Gf and ad := 1 t>T &af. Therefore 


V? (G T d , af, Gf, af) < V? (l t>T «Gf, 1 t>T oaf, Gf , af). 


(B.12) 


Pasting Gf and Gd by time consistency yields 1 t< Tj Gf + 1 r . <t<T dGf(Tj—) + 1 t>T &Gf, which 
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in conjuction with 1 t>T &af is (weakly) better than (Gf, af) “by combining (IB.121) and (IB.111) . 
We may thus even assume af = 0 on [#, r") wlog. regarding optimality. 

On {r i} < f 15 }, M t & > F t $ by right-continuity, whence (Gj’’ , a]''' ) = (l t>r ^,af) and 
(GJ^, aj # ) = (1 t>T s , af) are mutual best replies with payoffs (GJ^ , af, G; f , af) = M r $ = 
rna x(F T ti,M T v). Proposition 17.11 shows that (G^,Oif) = (l t>T v,af) and (Gf 9 , af 9 ) = 
(1 t>T #,af) are mutual best replies on {t' 1 = f 15 } with payoffs Vf'' (GJ 9 , af . Gj' 5 , af ) = 
max(F r tf, M r $). For {i? = t t> } this directly implies optimality, while for {■$ < r”} we ob¬ 
tain the estimate 


Vf(Gf,af,Gf,af) < V?(l t<T *Gf + l t>T «,l t>T »at,Gf,af) 


by extending (IB.Ill) to r d with r d < infjf > $|af(t) + af(t) > 0} thanks to the previous 
steps. 

In summary, this means that for player i it suffices to verify optimality of Gf against Gf 
as (standard) feasible mixed strategies if we use the payoffs 


Vi(Ga, Gf) = E 



+ (1 - G£(r*-))(1 - Gj(r*-)) max(F>,M r „) 


=(1-G%(t<>)+AGZ(t»))AG?(t#) 



= E 


[ SfdGf 


)[ 0,oo] 

- 


for any feasible Gf, where we set Sf = f^^^FdGf + (1 — Gf (r' 1 —)) max(F r ,9 , M r g ) on 
[t^jOo], on which in particular Gf = 1; analogously for player j. This is however equivalent 
to the setting of Theorem ED with A Gf (F — M) = 0 a.s. at (note that A Gf (F — M) > 0 at 
1) = inf{t € M+ | Gf(t) = 1} since < inf{f >'d\M t > F t }), which proves optimality. 
Time consistency of G± and G 2 is obtained exactly as in Theorem 15.31 and holds trivially 
for oq and oq because af in Proposition 17.11 does not depend on 1 ? (except for the feasibility 
condition af =0 on [0,i?), of course). 

Finally, if either F T = M r or r = inf{f > r | Lt > Ft} when L T = F T , then the above 
is equivalent to the setting of Theorem 15.11 with the condition A Gf(F — M) = 0 a.s. at 
1) < r*. □ 

B.2 Proof of Proposition I8TT1 

We begin with Lemmata IB. 21 to IB.41 which establish some important necessary conditions 
for strategies and payoffs in payoff-symmetric equilibria. They are crucial for the subsequent 
proof of Proposition 18.11 
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Lemma B.2. In any pay off-symmetric equilibrium, for any d G ST 


f 1 L#F (1 -Gt)dGf= f - G'l) dC4 

for all t G M_|_ a.s. (with Gf_ the left limit Gf — AGf ) and hence 

dGj iGj 

L * F (l-Gi_) L * F (l-G°_y (B 3) 

which is to be interpreted as “ = 0 ” if (1 — Gf_)(l — G f_) = 0. Both representations also hold 
with the right limits (1 — Gf), (1 — Gf). 

Proof. First consider any with d < r < t 9 = inf{f > d \ af(t) + af (t) > 0} a.s. Time 

consistency and iterated expectations imply for i,j G {1,2}, i j, 


Vf (Gf ,af,Gf, af) = e\ [ (1 -Gf)LdGf + f (l — Gf)F dGf + ^ MAGfAGf 

G[o,r) Po,t) 

+ (1 - G?(r-))(l - Gf(r-))^(G[,aJ,GJ,aJ) | J** 

/ (1 -Gf_)LdGf + f (1 -Gf_)FdGf (B.14) 

J[0,t) J[0 ,t) 


+ ^ (M - L - F)AGfAGf 

[0,t) 


+ (1 - G?(r-))(l - Gj (r—)) Vff (Gf ,af,GJ,aJ) 




In a payoff-symmetric equilibrium Vf — V]' = V{ — VJ = 0, hence 


E 


f (l-Gf_)(L-F)dGi 

J[0,t) 



-Gf_){L-F) dGf 




= 0 


for any r G [d,f 9 ]. The integrals represent two signed optional random measures^ which 
agree on all optional sets in [0, f^) (trivially on [0,r?) by Gf{d—) = 0; the optional <T-field is 
generated by the stochastic intervals [0, r), t G IF). L and F are optional processes, hence we 
may cancel (L-F)ff 0 to observe two left-continuous (thus optional) processes lp^p(l — 
Gf_) dGf and /[ 0 ^ 1l^f(1 — Gf_) dGf that agree in expectation at any stopping time t < f 9 . 
They are thus indistinguishable up to f 9 by the uniqueness of optional projections. 

The two measures Jj 0 ^ — Gf_) dGf do not charge (f^,oo), where Gf V Gf = 1. 

For the remaining set [f 11 ], suppose f° = ff (cf. Definition 1C. IF so Gf ( t 9 ) = Gf° (f 9 ) = 1 by 

% t] (l — G 2 -)(L — F) dGf and J" (1 — Gf_)(L — F) dGf are adapted, right-continuous and of finite 
variation. Their minimal decomposition is using (L — F) + and (L — F)~ , respectively. 
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A y Q A ^ A ^ 

time consistency. Now, if GJ (f w ) = 0, V.f — F^a and VJ = L^a, implying L-a = F-a. 

If GJ^(t^) 6 (0,1), then, by Riedel and Steg (2014), Section 3, i is indifferent between 


stopping and waiting, i.e., Vf" = F f a = af(f^)M fl ? + (1 — a^(f & ))L-a. On the other hand, 




Rf 9 = Gt*(t°)[af(t°)M^ + (l-c$(f*))F f *] + (l-G$*(t*))L f *. If > F>, j would set 
of (f^) = 1, contradicting the indifference of i. If M^a < F-a , af (f^) = 0 and hence L^a = F^a 
by the indifference of i. Finally, if M f a = F r », R 1 "' 9 = = R^ 19 = Gf {f®)F-a + (l — 

GJ^(f^))Lfa, and hence again L^a = FR;. In summary, we must have GJ^(t & ) = 1 if L?a ^ 


F^ and then by time consistency AGf(f^) = 1 — Gf (f —) and AGf (f”) = 1 — Gf(f —), 
which makes our measures above agree on all of [0, oo). 


The representation (IB.131) is obtained by integrating [(1 — Gf_)(l — Gf_)] on {Gf_ V 
Gf_ < 1} w.r.t. each measure. Finally, 1 l^f( 1 — Gf) dGf = 1 (1 — Gf) dGf is obtained 

analogously, skipping the step (IB.141) . □ 


Lemma B.3. Gf = Gf on [d, inf{t > d | 1 L=F (dGf + dGf) > 0}) a.s. 

Proof. It is easier to work with 1 l ^ f (1 — Gf) dGf = 1 l ^ f (1 — Gf) dGf, i,j € {1, 2}, i j. 
On the interval [d, inf{t > d | l_L=F(dGf + dGf) > 0}) we can ignore 1 l^f, whence we now 
get Gf (d) = Gf(d) and, while Gf < 1, dGf = 0dGf with 


1 — Gf 
l-Gf 


(B. 15) 


By rearranging (IB. 151) . for all t <E [d, inf{t > d | lz,=F(dGf + dGf) > 0}) D {Gf (t) < 1}, 
Gf(f) = 1 - (1 - Gf (t))& = Gf (0) + [ (fdGj 

J(o A 

(j) t — l = Gj (t)(/) t — G'j (0) — [ cfdG f 

7(0,t] 

<j) t ~ 4>o = [ Gf_ d(f>. 


The last line is obtained by integration by parts, as f> is right-continuous while Gf <1. It 
implies that f must indeed be constant where Gf < 1. If Gf jumps to 1 where L F then 
(l — Gf)AGf = (l — Gf)AGf = 0, i.e., Gf must attain 1, too, which completes the proof. □ 

Lemma B.4. Suppose AGf(d) V AGf (d) > 0 in a pay off-symmetric equilibrium. Then 


F$ < max(L,?, M*) and R/ = Rf 5 < AGf (d) max(F^, M$) + (l - AGf (d))L tf 


for j = 1,2, a.s. 

Proof. To simplify notation, note that AGf(d) = Gf(d), i = 1,2. First consider d < f^. If 


40 












Gf(t?) A Gf($) > 0) either player i is willing to stop, so V'f = Gf ('d)AF) + (1 — Gf (i?))L# > 
Gj(d)F^ + (1 — Gj($))L$ and hence M# > it#, a.s., proving the claim rd 

If Gf (i?) A Gf(t?) = 0, then Lemma fB.21 (resp. equation (IB. 131) 1 implies L$ = F$ a.s. The 
player i with Gf(i?) > 0 is willing to stop, so Vf = L# = Gf(fl)F$ + (1 — Gf(d))L^ = Vf > 
Gf(d)M^ + (l — Gf('d))L^. The second equality just applies L$ = F$ and the estimate is 
player f s opportunity to stop (it now shows also F# > M$, a.s.). 

With extended, mixed strategies, suppose now i? = ff (cf. Definition 1C.II) . so Gf(i?) = 1. 
Then, by Riede l and Steg (2014), Section 3, Vf = max{F$, dj($)M# + (1 — dj{d))L^}. On 
{F$ > max(Af, 9 , L$)}, player i would ensure to become follower to realize Vf = F$, whence j 
would become leader for sure with Vf = L$ < F$, contradicting payoff symmetry. 

Now suppose Vf > rna x(F$, M#). Then we would have Vf = of + (1 — of (t?))L# > 

F$, and i would ensure not to become follower. Then j would for sure not become leader 
and Vf < max(F,), M$), contradicting payoff symmetry. This proves the claim for i. If also 
d = 7f and hence Gf (i9) = 1, reversing roles shows that V- 1 < max(F, 9 , M#). If d = rf < rf, 
Vf = Gf(d)[a?(d)M#+(l-a»(d))F#]+(l-Gf(d))L^ < Gf^)mBx(F^M^) + (l-Gf^))L 6 , 
finishing the proof. □ 


Proof of Proposition I8TT1 Fix d £ ST. By optimality we must have for any r > d with 
Gf(r—) < 1 a.s. that 

in any equilibrium. By time consistency and iterated expectations, 




$ / /IT 


. $ /^rd i9 
t j 5 


) = E 




F dGj + (1 — Gf(r—))Vf(G{ 


a,- 




^9 


(B.16) 


Now define the stopping times 


t(x) := inf {/ > d | lF>i(c/Gf + dGj ) > 0} A inf{f > $ | Gf V Gf > x} 

for x € [0,1), and their monotone limit f (1) := lhn x ^j f (x). Then (IB. 161) holds for all f(x), 
0 < x < 1, with FdGj = (L A F) dGj. The integral converges as x -A 1, also in expectation 
because F is of class (D). 

First consider the set {Gf(f(l)— ) V Gf(f(l) —) < 1}, where Vf lx> — > Vf^ a.s. Further, 
if Gf V Gj < 1 at f(l), then 1 F>L(dGf + dGj ) > 0, hence Ff(i) > Lf(i) an d f(l) is a point 
of increase for either Gf or Gf so at least one player is willing to stop. If there is no jump, 
then = Vf = Lf(i)- If there is any jump, then Ff(i) = Lf(i) an( f < Ff(i) by 

43 Suppose that Gf(r*) > 0 in an equilibrium, G > r* £ ST, so E^Sf(r*) | = ess sup^ <Tg g- E^Sf (r) | 

> lim^o E [Sf (r* + e) | . As L is right-continuous by assumption, we have Sf (t+) — Sf ( t ) = 

{Ft - Mt)AGj (t). Since Sf is of class (D), lin^o E[Sf {t* + e) \ J^] = E[sf(r*+) \ hence (F T . - 
M T *)AGj{r*) < 0 a.s. 


41 


















Lemma IB.41 and the hypothesis M < F. On the other hand, if Gf V Gf = 1 at f(l), then 
G[( 1 )(f(l)) VGj^(f(l)) = 1, hence < ^(l) < bv Lemma IB. 4 1 and M < min(F, L). 

Now consider the set {Gf(f(l) — ) V Gf (f(l)—) = 1}, which means that one distribution 
has fully grown on {F < L}. Then in fact Gf (f(l)—) = Gf(f (1)—) = 1 by Lemma fB. 31 Thus, 
(l — Gf(f(x)—))Vf^ x ' > —> 0 a.s. because (yL^) Q ^ is uniformly integrable (see Lemma IA. 71) . 

In total, we have that (l — Gf (t(x) -))vT x) in (IB.161) converges a.s. and in expectation 
to a limit bounded by (1 — Gf (f (1)—)) (L^) A F^), which completes the proof. Note that 
we can also state the result in the form 


Vf{Gf,af,Gf,af) < A Gf(0)F# + (l - AGf(V))U LAF (V) < U L af(0). 


□ 


B.3 Proof of Theorem 18.21 

Proposition 18.11 shows that in a payoff-symmetric equilibrium, the continuation values are at 
most 

A Gf($)F# + (1 - A Gf(#)) ess sup E[L r A F r \ < U LAF (V), 


$<Te& 


in fact considering only stopping times r < inf{f > d \ Gf V Gf = 1}. However, if L > 
Ulaf and Gf < 1, player V s value Vf would be at least A Gf^F^ + (1 — A Gf(i)))L^ 
by considering the limit of stopping at i? + £ with e \ 0 (for convergence, cf. footnote 
H51) ; a contradiction. Therefore the game beginning at i? cannot continue past to($), and 
the continuation values are actually bounded by U^ LAF y 0 (o) < Ulaf • This argument of 
course iterates, inducing decreasing sequences of stopping times ( T n {d)) n and Snell envelopes 
(^(LAF) 7 ‘”(’ , )) n ’ as one introduces increasingly strict constraints. The decreasing sequence of 
stopping times is bounded below by $, so the limit f($) = linin^oo T n ($) exists and is a 
stopping time as well. 

It is convenient to have optimal stopping times that attain U^ LAF y n (^), respectively. They 
exist because each process (L A F) 7 "^) is clearly right-continuous and of class (D), but also 


upper-semi-continuous in expectation under Assumption 12.1 (iii 


These optimal stopping times simplify the argument to prove f ($) < inf{t > t? | L > 
Suppose to the contrary that there is a stopping time a € [$, r(i?)] with L a > 
U(LAF) f w( a ) with positive probability, i.e., L a > E[L t A F t | J^o-] for all r £ [cr, f ($)]. Since 
L a A ^{ LAF )TnW (c) a.s. for each n £ N, there must exist stopping times a n £ [f(i?),r n (i?)] 
such that L a < E[L an A F an | JF CT ], Necessarily cr n —>• f(i?) a.s., and by right-continuity and 
L being of class (D), L a < E[L f ^ A F f ^ | ■'F a \ < U^ LAF y^){a). Thus there is no threat of 
preemption on [?9, f($)) if U^ LAF y^ is the continuation value. 

f(i9) is indeed maximal by construction. Whenever we have another f > •& such that 
L < U( LAF y on [i?,f), then f < r 0 (i?) a.s., because U^ LAF y < U L af • So U {LAF y < U^ LAF y oW 
and by iteration t < r n (d), n € N. 
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Let us remark some further regularity properties. Since U^ LAF y n ^) > (I AF) T " -^, we can 
only have L > U^ LAF \ Tn w if L > F, thus L > F by right-continuity at all Tk(i9), k £ No, and 
at f(?9), too. Moreover, 


f (i?) = inf (i > f(i?) | L* > a.s. (B.17) 

because we would otherwise have L = L A F < U^ LAF y k w for all k £ No until the stopping 
time on the right. By construction we have f(i? 7 ) = f(i?) a.s. on {d 7 £ [$, f(i?)]} for any 
d' £ <!?, hence f(i9) is monotone in i9 and 

t(i9) = lim f(i} n ) a.s. (B.18) 

n—too 

for any sequence d n \ i?, C Indeed, (IB.181) also holds on {$ = f($)}: If we denote 

the limit by f 7 > f($), then f{d n ) = f 7 on {d n < t 7 }, and therefore, using that U, LAF y is a 
sup er mar t ingale, 


U (L/\F) fl (#)>E[U (LAFr (# n ) \&»\ 

— F[F$ n l$ n <f' T Ff '| 7^"i?] 

for any n £ N. The term on the second line goes to L$ on {i? < f 7 } as n —>■ oo, thanks to L 
being right-continuous and of class (D). This argument implies L < U, LAF y on [$,f 7 ). By 
the maximality of f (i?) shown before, f($) = f a.s. 

Now define the process L by 


u 


L t t < t (i?) 

Fr(tf) t > f (l?) 


with Snell envelope U F and its compensator D F . From > F f ^ we see that ( LAF) T ^ < 
L < U( LaF ^t( 0 ) < U F on [i9, oo]. Since the Snell envelope is the smallest supermartingale 
dominating the payoff process, the last inequality must actually bind and thus also D F = 
D (LAFfW on [$,oo]. By this observation we obtain 


/ 1l>f dD F = ^2 AD f = 0 a.s., (B.19) 

[0,oo] 

which we need to apply Theorem 15.11 for d and = f($). Indeed, the path regularity of 
(L A F) T ^ verified before implies continuity of F)^ LaF \?(■&) and hence that L < U, LAF y(&) = 
LAF, i.e., F > L at any point of increase of dD^ LAF y^) on [$, f($)). Furthermore, F)^ LaF \?(#) 
is right-continuous and flat from f(i?) on, so it does not charge [f($),oo], a.s. 
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With (IB.191) and (IB.171) we can now define supporting equilibrium strategies (Gf, af) 
and (Gf, oQ) as follows. First, if we concentrate on a fixed z9, it is enough to set, for any 

■— (f 9)))®i 


with af as in Proposition 17.11 z = 1,2. Now pick z, j G {1,2}, z / j, and define Gf := Gf, 
GJ := GJ as in Theorem 15.II with r' 7 := inf{t > cr | (l —1 (L t — F t ) > 0} (= r(z?) a.s. on 

{a £ [r9, w('i9)]} by (IB.17D 1. These strategies form a subgame-perfect equilibrium analogously 
to Theorem 17. Ml with payoff (&') = at any z?' £ [z?, f(z?)], since f(i? / ) = f(z?) 

a.s. (note that the endpoint condition A Gf(F — M ) > 0 at = inf{i G M+ | Gf(t) = 1} 

is void thanks to M < F). 

If we want the respective maximal payoff to be attained in every subgame in one equi¬ 
librium, we need to be careful how to aggregate the random intervals [z?, f(z?)) - on which 
to suppress a i - in a measurable way across all z9 G 8T. Therefore, begin with the (op¬ 
tional) set A := Uged+Ifb (<?))• Now approximate any given z? G 2F from above by the 
decreasing sequence (z?„) ne N, where z? n := 2~ n |~2 n z?~| G 3T. Any i9 n takes values only in 
{k2~ n ]k £ N} and thus [z? n ,f(z9 n )) = J2km[^~ n ^(k2~ n ))l^ n=k2 -^ a.s., which means that 
[z? n , f (z9 n )) belongs to A up to a nullset. Further, z? n < z? + 2~ n implies f(z?) = f(z9fc) for all 
k > n on { t ( z 9 ) > z9 + 2~ n } a.s., and therefore on this set (z?, f(z?)) = U k>nWk,f('&k)) a.s., 
which means that (z9,f(z?)) belongs to A on {f(z?) > z? + 2~ n } up to a nullset. Aggregating, 
(z?,f(z?)) = (z?,f(z?))(l f (^)> 2 -i +En lf(i?)e[i?+ 2 -"- 1 ,i?+ 2 -")) a.s. and hence belongs to A up to 
a nullset. 

Now define 

af := (lim sup l u eA c ) oif 
^ u \t 


for any z9 G 3T, with af as in Proposition 17.11 z = 1,2. df does not d epe nd on z? and is still 
progressively measurable by Theorem IV.33 (c) in Dellacherie and Me yer (1978). Further, 
af = 0 on [z?, r(z?)) a.s. It remains to verify right-continuity (where af < 1) for a subgame- 
perfect equilibrium along the previous lines, and that f(z?) = inf{f > f(z?) | af(t) > 0} a.s. 

Consider the set {lim sup^s^ l n eA c / lim inf^x^ l u gA c } for some cr G 5. We cannot 
have L a < F a on that set with positive probability, because then we would have f(a) > 
inf{t > a | Ft > L t } > a and lim sup M VfT l u eA c = 0. We cannot have L a > F a with positive 
probability, either, because then lim^y^ l u &A c would exist. Indeed, we would have 


f(q) > inf{t > a \ F t - F a > (L a - F a )/3 or L t - L a < -( L a - F a )/ 3} =: a' 


for all [i q,f(q )) D (<r, a') / 0 a.s. by L a > supr^/TL A F ), which means that there cannot 
be shorter and shorter intervals in A as we approach a from the right. In summary, we only 
possibly affect right-continuity of af < 1 by moving to af where L = F and af > 0, but 
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there af = 1 by definition. 

Finally, if we consider a (rationally valued, as above) sequence \ then (IB.181) 

implies lirn sup u \^ f 1 u£ a c = 1 a.s., and (IB.171) implies f($) = inf {i > f($) | af(t) > 0} a.s. 
by the definition of af. 

Now we can define Gf := Gf, Gf := Gf as in Theorem 15. II with t' 9 \= f(d) for any d € &. 
These strategies form a subgame-perfect equilibrium analogously to Theorem 17.31 with payoff 
u (laf)H+W = Ui{d) at any ilel □ 


C Outcome probabilities 


The following definition is a simplification of that in IR iedel and Ste g (2014), resulting from 
right-continuity of any af(-) also where it takes the value 0. 

Define the functions /jll and mm from [0, l] 2 \ (0, 0) to [0,1] by 


»L(x,y) := 


- y) 

x + y - xy 


and n M (x,y) := 


xy 


x + y — xy 


Ml («*,«;) is the probability that player i stops first in an infinitely repeated stopping game 
where i plays constant stage stopping probabilities a, and player j plays constant stage 
probabilities aj. yM(ai,<ij) is the probability of simultaneous stopping and 1 — M£( a o a j) ~ 
Mm (a*, aj) = /j,L(a,j,ai ) that of player j stopping first. 

Definition C.l. Given d € ST and a pair of extended mixed strategies (Gf , af) and (Gf, af), 
the outcome probabilities Af l5 Af 2 and Af} at := infjt > d | af (t) + af (t) > 0} are defined 
as follows. Let i,j £ {1,2}, i ^ j. 

If t* 9 < Tj := inf {t > $ | af (t) > 0}, then 


+ := (1 - Cf+—))(1 — ++)), 
+ := (1 - Gt+~))at(rO)A+(rO). 


If t* 9 < t f := inf {t > $ | af (t) > 0}, then 


A l,i ■= (1 - G}(?*-))( 1 - a J (f^)AGf(^), 
A:= (1 - Gf (r»-))af (r»)AGf (A 

If = ff = ff and af (f 19 ) + afCr 19 ) > 0, then 


A?,* := (1 - Gf (fM)(l - Gf(t»-))y L (at(f»),af(?»)), 
Am := (l-Gf(r l9 -))(l-Gf(r l9 -))MM(af(f’ 9 ),af(r’ 9 )). 
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If f ^ = ff = fvf and af( t®) + afC^) = 0; then 

Al j :=(l-Of(f< , -))(l-Gf(f <, -))I{ liminf M (a f(t),af(t)) 

af(i)+a*(t)>0 

+ limsup /ii(af (f), aq(f)) >, 


t\f® 
af(t)+a^(t)>0 

Am : = 0. 

Remark C.2. 

(i) Am is the probability of simultaneous stopping at f rl , while A^ i is the probability of 
player i becoming the leader, i.e., that of player j becoming follower. It holds that 
A^+A^+A^. = (1-Gf(^-))(1-G^(^-)). Dividing by (i-G?(f*-))(l-Gj(r*-)) 
where feasible yields the corresponding conditional probabilities. 

(ii) If any af = 1, then no limit argument is needed. Otherwise both af are right-continuous 
and the corresponding limit of /im exists, /jll, however, has no continuous extension at 
the origin, whence we use the symmetric combination of lim inf and lim sup, ensuring 
consistency whenever the limit does exist. If the limit in a potential equilibrium d oes not 
exist, both players will be indifferent about the roles; see Lemma A.5 in Riedel and Steg 
(2 0141 ). 
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